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CONSENSUS/ANCESTRAL IMMUNOGENS 

This application claims priority from Prov. 
Appln. No. 60/503,460, filed September 17, 2003, and 
Prov. Appln. No. 60/604,722, filed August 27, 2004, 
5 the entire contents of which are incorporated herein 
by reference . 

TECHNICAL FIELD 

The present invention relates, in general, to 
an immunogen and, in particular, to an immunogen for 

10 inducing antibodies that neutralize a wide spectrum 
of HIV primary isolates and/or to an immunogen that 
induces a T cell immune response . The invention 
also relates to a method of inducing anti-HIV 
antibodies, and/or to a method of inducing a T cell 

15 immune response, using such an immunogen. The 

invention further relates to nucleic acid sequences 
encoding the present immunogens . 

BACKGROUND 

The high level of genetic variability of HIV- 1 
20 has presented a major hurdle for AIDS vaccine 

development. Genetic differences among HIV-1 groups 
M, N, and O are extensive, ranging from 30% to 50% 
in ga.g and env genes, respectively (Gurtler et al, 
J. Virol. 68:1581-1585 (1994), Vanden Haesevelde et 
25 al, J. Virol. 68:1586-1596 (1994), Simon et al , Nat. 
Med. 4:1032-1037 (1998), Kuiken et al , Human 



1 



WO 2005/028625 PCT/US2O04/030397 



retroviruses and AIDS 2000: a compilation and 
analysis of nucleic acid and amino acid sequences 
(Theoretical Biology and Biophysics Group, Los 
Alamos National Laboratory, Los Alamos, New 
5 Mexico) ) . Viruses within group M are further 

classified into nine genetically distinct subtypes 
(A-D, F-H, J and K) (Kuiken et al , Human 
retroviruses and AIDS 2000: a compilation and 
analysis of nucleic acid and amino acid sequences 

10 (Theoretical Biology and Biophysics Group, Los 

Alamos National Laboratory, Los Alamos, New Mexico, 
Robertson et al , Science 288:55-56 (2000), Robertson 
et al, Human retroviruses and AIDS 1999: a 
compilation and analysis of nucleic acid and amino 

15 acid sequences, eds. Kuiken et al (Theoretical 

Biology and Biophysics Group, Los Alamos National 
Laboratory, Los Alamos, New Mexico), pp. 492-505 
(2000)) . With the genetic variation as high as 30% 
in env genes among HIV-1 subtypes, it has been 

20 difficult to consistently elicit cross-subtype T and 
B cell immune responses against all HIV-1 subtypes. 
HIV-1 also frequently recombines among different 
subtypes to create circulating recombinant forms 
(CRFs) (Robertson et al , Science 288:55-56 (2000), 

25 Robertson et al , Human retroviruses and AIDS 1999: a 
compilation and analysis of nucleic acid and amino 
acid sequences, eds. Kuiken et al (Theoretical 
Biology and Biophysics Group, Los Alamos National 
Laboratory, Los Alamos, New Mexico) , pp. 492-505 

30 (2000), Carr et al , Human retroviruses and AIDS 

1998: a compilation and analysis of nucleic acid and 
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amino acid sequences, eds . Korber et al (Theoretical 
Biology and Biophysics Group, Los Alamos National 

Laboratory, Los Alamos, New Mexico) , pp. III-10-III- 

f 

19 (1998)). Over 20% of HIV-l isolates are 

5 recombinant in geographic areas where multiple 
subtypes are common (Robertson et al, Nature 
374:124-126 (1995), Cornelissen et al , J. virol . 
70:8209-8212 (1996), Dowling et al , AIDS 16:1809- 
1820 (2002)), and high prevalence rates of 

io recombinant viruses may further complicate the 
design of experimental HIV-l immunogens . 

To overcome these challenges in AIDS vaccine 
development, three computer models (consensus, 
ancestor and center of the tree) have been used to 

15 generate centralized HIV-l genes to (Gaschen et al, 
Science 296:2354-2360 (2002), Gao et al , Science 
299:1517-1518 (2003), Nickle et al , Science 
299:1515-1517 (2003), Novitsky et al , J. Virol. 
76:5435-5451 (2002), Ellenberger et al , Virology 

20 302 : 155-163 (2002) , Korber et al , Science 288:1789- 
1796 (2000)). The biology of HIV gives rise to 
star-like phylogenies, and as a consequence of this, 
the three kinds of sequences differ from each other 
by 2 - 5% (Gao et al , Science 299:1517-1518 (2003)). 

25 Any of the three centralized gene strategies will 

reduce the protein distances between immunogens and 
field virus strains. Consensus sequences minimize 
the degree of sequence dissimilarity between a 
vaccine strain and contemporary circulating viruses 

30 by creating artificial sequences based on the most 
common amino acid in each position in an alignment 



3 



WO 2005/028625 



PCT/US2O04/030397 



10 



(Gaschen et al, Science .296 : 2354-2360 (2002)). 
Ancestral sequences are similar to consensus 
sequences but are generated using maximum- likelihood 
phylogenetic analysis methods (Gaschen et al , 
Science 296:2354-2360 (2002), Nickle et al , Science 
299:1515-1517 (2003)) . In doing so, this method 
recreates the hypothetical ancestral genes of the 
analyzed current wild-type sequences (Figure 26) . 
Nickle et al proposed another method to generate 
centralized HIV-1 sequences, center of the tree 
(COT) , that is similar to ancestral sequences but 
less influenced by outliers (Science 299 : 1515-1517 
(2003) ) . 

The present invention results, at least in 
15 part, from the results of studies designed to 

determine if centralized immunogens can induce both 
T and B cell immune responses in animals. These 
studies involved the generation of an artificial 
group M consensus env gene (C0N6) , and construction 
20 of DNA plasmids and recombinant vaccinia viruses to 
express C0N6 envelopes as soluble g P 12 0 and gpl4 0CF 
proteins. The results demonstrate that CON6 Env 
proteins are biologically functional, possess 
linear, conformational and glycan-dependent epitopes 
of wild- type HIV-1,. and induce cytokine -producing T 
cells that recognize T cell epitopes of both HIV 
subtypes B and C. Importantly, C0N6 gpl20 and 
gpl40CF proteins induce antibodies that neutralize 
subsets of subtype B and C HIV-1 primary isolates. 

The iterative nature of study of the 
centralized HIV-1 gene approach is derived from the 
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rapidly expanding evolution of HIV-1 sequences, and 
the fact that sequences collected in the HIV 
sequence database (that is, the Los Alamos National 
Database) are continually being updated with new 
5 sequences each year. The CON6 gpl2 0 envelope gene 
derives from Year 1999 Los Alamos National Database 
sequences, and Con-S derives from Year 2 00 0 Los 
Alamos National Database sequences. In addition, 
CON6 has Chinese subtype C VI , V2 , V4 , and V5 Env 

10 sequences, while Con-S has all group M consensus Env 
constant and variable regions, that have been 
shortened to minimal -length variable loops. Codon- 
optimized genes for a series of Year 2003 group M 
and subtype consensus sequences have been designed, 

.15 as have a corresponding series of wild- type HIV-1. 
Env genes for comparison, for use in inducing 
broadly reactive T and B cell responses to HIV-1 
primary isolates. 

SUMMARY OF THE INVENTION 

20 The present invention relates to an immunogen 

for inducing antibodies that neutralize a wide 
spectrum of HIV primary isolates and/or to an 
immunogen that induces a T cell immune response, and 
to nucleic acid sequences encoding same. The 

25 invention also relates to a method of inducing anti- 
HIV antibodies, and/or to a method of inducing a T 
cell immune response, using such an immunogen. 

Objects and advantages of the present invention 
will be clear from the description that follows. 
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BR IEF DESCRIPTION OF THE DRAWINGS 

Figures 1A-1D: Generation and expression of 
the group M consensus env gene (CON6) . The complete 
amino acid sequence of CON6 gp!60 is shown. 
5 (Fig. 1A) The five regions from the' wild- type 
CRF0 8JBC (98CN006) env gene are indicated by 
underlined letters. Variable regions are indicated 
by brackets above the sequences. Potential N-liked 
glycosylation sites are highlighted with bold-faced 

10 letters. (Fig. IB) Constructs of CON6 gpl20 and 
gpl4 0CF. CONS gpl2 0 and gpl4 0CF plasmids were 
engineered by introducing a stop codon after the 
gpl2 0 cleavage site or before the transmembrane 
domain, respectively. The gpl2 0/gp41 cleavage site 

15 and fusion domain of gp41 were deleted in the 

gpl4 0CF protein. (Fig.lC) Expression of CON6 gpl2 0 
and gpl4 0CF. CON6 gpl2 0 and gpl4 0CF were purified 
from the cell culture supernatants of rW- infected 
2 93T cells with galsmthus Nivalis argrarose lectin 

20 columns. Both gpl20 and gpl40CF were separated on a 
10% SDS-polyarylamide gel and stained with Commassie 
blue. (Fig. ID.) CONS env gene optimized based on 
codon usage for highly expressed human genes. 

Figures 2A-2E. Binding of CON6 gpl2 0 gp!4 0 CF 
25 to soluble CD4 (sCD4) and anti-Env mAbs . (Figs. 2A- 
2B) Each of the indicated mabs and sCD4 was 
covalently immobilized to a CM 5 sensor chip 
(BIAcore) and CONG gpl20 (Fig. 2A) or gpl40CF (Fig. 
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2B) (10 0 /xg/ml and 300 fig/ml, respectively) were 
injected over each surface. Both gpl2 0 and gp!4 0CF 
proteins reacted with each anti-gpl20 mabs tested 
except forl7b mab, which showed negligible binding 
5 to both CONG gp!2 0 and gpl4 0CF. To determine 
induction of 17b mab binding to CONG gpl2 0 and 
gpl40CF / CONG gpl20 (Fig. 2C) or gpl40CF (Fig. 2D) 
proteins were captured (400-580 RU) on individual 
flow cells immobilized with sCD4 or mabs A32 or T8 . 

10 Following stabilisation of each of the surface, mAb 
17b was injected and flowed over each of the 
immobilized flow cells. Overlay of curves show tbat 
the binding of mab 17b to CONG Env proteins was 
markedly enhanced on both sCD4 and mab A32 surfaces 

15 but not on the T8 surface (Figs. 2C-2D) . To 

determine binding of CONG gpl2 0 and gpl4 0CF to human 
mabs in ELISA, stock solutions of 20p.g/ml of mabs 
447, F39F, A32, IgGlbl2 and 2F5 on CONG gpl20 and 
gpl40CF were tittered (Fig. 2E) . Mabs 447 (V3), 

20 F39F (V3) A32 (gp!20) and IgGlbl2 (CD4 binding site) 
each bound to both CONG gpl2 0 and 14 0 well, while 
2F5 (anti-gp41 ELDKWAS) only bound gpl4 0CF. The 
concentration at endpoint titer on gpl20 for mab 447 
and F3 9F binding was <0.0 03 £tg/ml and 0.00G fig /ml , 

25 respectively; for mab A32 was <0.125 /xg/ml; for 
IgGlbl2 was <0.002 /xg/ml ; and for 2F5 was 0.01G 
/xg/ml . 

Figures 3A and 3B. Infectivity and coreceptor 
usage of CONG envelope. (Fig. 3A) CONG and control 
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env plasmids were cotransf ected with HIV-l/SG3Aenv 
backbone into human 293T cells to generate Env- 
pseudovirions . Equal amounts of each pseudovirion 
(5 ng p24) were used to infect JC53-BL cells. The 
5 infect ivity was determined by counting the number of 
blue cells (infectious units, IU) per microgram of 
p24 of pseudovirons {113/ fig p24) after staining the 
infected cells for P~gal expression. (Fig. 3B) 
Coreceptor usage of the C0N6 env gene was determined 

10 on JC53BL cells treated with AMD3100 and/or TAK-79 9 
for 1 hr (3 7°C) then infected with equal amounts of 
p24 (5 ng) of each Env-pseudovirion . Infect ivity in 
the control group (no blocking agent) was set as 
100%. Blocking efficiency was expressed as the 

15 percentage of IU from blocking experiments compared 
to those from control cultures without blocking 
agents. Data shown are mean ± SD . 

Figure 4. Western blot analysis of multiple 
subtype Env proteins against multiple subtype 

20 antisera. Equal amount of Env proteins (100 ng) 
were separated on 10% SDS-polyacrylamide gels. 
Following electrophoresis, proteins were transferred 
to Hybond ECL nitrocellulose membranes and reacted 
with sera from HIV-l infected patients (1:1,000) or 

2 5 guinea pigs immunized with CON6 gpl2 0 DNA prime, rW 
boost (1:1,000). Protein-bound antibody was probed 
with fluorescent -labeled secondary antibodies and 
the images scanned and recorded on an infrared 
imager Odyssey (Li -Cor, Lincoln, NE) . Subtypes are 
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indicated by single-letters after Env protein and 
serum IDs. Four to six sera were tested for each 
subtype, and reaction patterns were similar among 
all sera from the same subtype. One representative 
5 result for each subtype serum is shown. 

Figure 5. T cell immune responses induced by 
CON6 Env immunogens in mice. Splenocytes were 
isolated from individual immunized mice (5 
mice/group) . After splenocytes were stimulated in 
vitro with overlapping Env peptide pools of CON6 
(black column) , subtype B (hatched column) , subtype 
C (white column) , and medium (no peptide; gray 
column) , INF- y producing cells were determined by 
the ELISPOT assay. T cell IFN-y responses induced 
by either CON 6 gpl2 0 or gpl4 0CF were compared to 
those induced by subtype specific Env immunogens 
(JRFL and 96ZM651) . Total responses for each 
envelope peptide pool are expressed as SFCs per 
million splenocytes.' The values for each column are 
the mean ± SEMjof IFN-y SFCs (n=5 mi ce /group ) . 

Figures 6A-6E. Construction of codon usage 
optimized subtype C ancestral and consensus envelope 
genes (Figs. 6A and 6B, respectively) . Ancestral 
and consensus amino acid sequences (Figs. 6C and 6D, 
25 respectively) were transcribed to mirror the codon. 
usage of highly expressed human genes. Paired 
oligonucleotides (80-mers) overlapping by 2 0 bp were 
designed to contain 5' invariant sequences including 
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the restriction enzyme sites EcoRI , Bbsl, Bam HI and 
BsmBI. Bbsl and BsmBI are Type II restriction 
enzymes that cleave outside of their recognition 
sequences. Paired oligomers were linked 
5 individually using PCR and primers complimentary to 
the 18 bp invariant sequences in a stepwise fashion, 
yielding 140bp PCR products. These were subcloned 
into pGEM-T and sequenced to confirm the absence of 
inadvertant mutations/deletions. Four individual 

10 pGEM-T subclones containing the proper inserts were 
digested and ligated together into pcDNA3 . 1 . Mult de- 
fragment ligations occurred repeatly amongst groups 
of fragments in a stepwise manner from the 5' to the 
3' end of the gene until the entire gene was 

15 reconstructed in pcDNA3 . 1 . (See schematic in Fig. 
6E. ) 

Figure 7. JC53-BL cells are a derivative of 
HeLa cells that express high levels of CD4 and the 
HIV-1 coreceptors CCR5 and CXCR4 . They also contain 

20 the reporter cassettes of lucif erase and p~ 

galactosidase that are each expressed from an HIV-1 
LTR. Expression of the reporter genes is dependent 
on production of HIV-1 Tat. Briefly, cells are 
seeded into 24 or 96-well plates, incubated at 37°C 

25 for 24 hours and treated with DEAE-Dextran at 37 °C 
for 30 minutes. Virus is serially diluted in 1% 
DMEM, added to the cells incubating in DEAE-Dextran, 
and allowed to incubate for 3 hours at 37 °C after 
which an additional cell media is added to each 
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well. Following a final 4 8 -hour incubation at 37°C, 
cells are either fixed, stained using X-Gal to 
visualize p-galactosidase expressing blue foci or 
frozen-thawed three times to measure luciferase 
activity. 

Figure 8. Sequence alignment of subtype C 
ancestral and consensus env genes. Alignment of the 
subtype C ancestral (bottom line) and consensus (top 
line) env sequences showing a 95.5% sequence 
homology; amino acid sequence' differences are 
indicated. One noted difference is the addition of a 

glycosylation site in the C ancestral env gene at 

i 

the base of the VI loop. A plus sign indicates a 
within-class difference of amino acid. at the 
indicated position; a bar indicates a change in. the 
class of amino acid. Potential N-glycosylation sites 
are marked in blue. The position of truncation for 
the gpl4 0 gene is also shown. 

Figure 9. Expression 1 of subtype C ancestral 
o and consensus envelopes in 293T cells. Plasmids 
containing codon-optimized gpl60 f gpl40, or gpl20 
subtype C ancestral and consensus genes were 
transfected into 293T cells, and protein expression 
was examined by Western Blot analysis of cell 
5 lysates. 48 -hours post-transf ection, cell lysates 
were collected, total protein content determined by 
the BCA protein assay, and 2 jig of total protein was 
loaded per lane on a 4-20% SDS-PAGE gel. Proteins 
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were transferred to a PVDF membrane and probed with 
HIV-1 plasma from a subtype C infected patient. 

Figures 10A and 10B. Fig. 10A. Trans 
complementation of env-def icient HIV-1 with codon- 
5 optimized subtype C ancestral and consensus gpl6 0 
and gpl40. Plasmids containing codon-optimized, 
subtype C ancestral or consensus gpl60 or gpl40 
genes were co- transf ected into 293T cells with an 
HIV-l/SG3Aenv provirus . 48 hours post-transf ection 
10 cell supernatants containing pseudotyped virus were 
harvested, clarified by centrif ugation, filtered 
through at 0 .2\M filter, and pelleted through a 20% 
sucrose cushion. Quantification of p24 in each 
virus pellet was determined using the Coulter HIV-1 
15 p24 antigen assay; 25ng of p24 was loaded per lane 
on a 4-20% SDS-PAGE gel for particles containing a. 
codon-optimized envelope. 250ng of p24 was loaded 
per lane for particles generated by co-transf ection 
of a rev-dependent wild-type subtype C 96ZAM651ez2\^ 
20 gene. Differences in the amount of p24 loaded per 
lane were necessary to ensure visualization of the 
rev-dependent envelopes by Western Blot. Proteins 
were transferred to a PVDF membrane and probed with 
pooled plasma from HIV-1 subtype B and subtype C 
25 infected individuals. Fig. 10B. Infectivity of 

virus particles containing subtype C ancestral and 
consensus envelope glycoproteins. Infectivity of 
pseudotyped virus containing ancestral or consensus 
gpl60 or gpl40 envelope was determined using the 



12 



WO 2005/028625 



PCT/US2004/030397 



JC53-BL assay. Sucrose cushion purified virus 
particles were assayed by the Coulter p24 antigen 
assay, and 5-fold serial dilutions of each pellet 
were incubated with DEAE-Dextran treated JC53-BL 

5 cells. Following a 48 -hour incubation period, cells 
were fixed and stained to visualize (3-galactosidase 
expressing, cells. Infectivity is represented as 
infectious units per ng of p24 to normalize for 
differences in the concentration of the input 

io pseudovirions . 

Figure 11. Co-receptor usage of subtype C 
ancestral and consensus envelopes. Pseudotyped 
particles containing ancestral or consensus envelope 
were incubated with DEAE-Dextran treated JC53-BL 

15 cells in the presence of AMD3100 (a specific 

inhibitor of CXCR4 ) , TAK779 (a specific inhibitor of 

CCR5 ) , or AMD3 0 0 0+TAK77 9 to determine co- receptor 

usage. NL4 . 3 , an isolate known to utilize CXCR4 , 

and YU-2, a known" CCR5 -using isolate,- were included 

20 as controls. 



Figures 12A-12C. Neutralization sensitivity of 
subtype C ancestral and consensus envelope 
glycoproteins. Equivalent amounts of pseudovirions 
containing the ancestral, consensus or 96ZAM651 
gplGO envelopes (1,500 infectious units) were pre- 
incubated with a panel of plasma samples from HIV-1 
subtype C infected patients and then added to the 
JC53-BL cell monolayer in 96-well plates. Plates 
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were cultured for two days and luciferase activity 
was measured as an indicator of viral infect ivity. 
Virus infectivity is calculated by dividing the 
luciferase units (LU) produced at each concentration 
of antibody by the LU produced by the control 
infection. The mean 50% inhibitory concentration 
(IC 5 o) and the actual % neutralization at each 
antibody dilution are then calculated for each 
virus. The results of all luciferase experiments 
are confirmed by direct counting of blue foci in 
parallel infections. 

Figures 13A-13F. Protein expression of 
consensus subtype C Gag (Fig. 13 A) and Nef (Fig. 
13B) following transfection into 293T cells. 
Consensus subtype C Gag and Nef amino acid sequences 
are set forth in Figs. 13C and 13D, respectively, 
and encoding sequences are set forth in Figs. 13E 
and 13F, respectively. 

Figures 14A-14C. Figs. 14A and 14B show the 
0 Con-S Env amino acid sequence and encoding sequence, 
respectively. Fig. 14C shows expression of Group M 
consensus Con-S Env proteins using an in vitro 
transcription and translation system. 

Figures 15A and 15B. Expression of Con-S env 
5 gene in mammalian cells. (Fig. 15A - cell lysate, 
Fig. 15B - supernatant.) 
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Figures ISA and 16B. Infect ivity (Fig. 16A) 
and coreceptor usage (Fig. 16B) of CON6 and Con-S 
env genes . 

Figures 17A-17C. Env protein incorporation in 
5 CON 6 and Con-S Env-pseudovirions . (Fig. 17A - 

lysate, Fig. 17B - supernatant, Fig. 17C pellet.) 

Figures 18A-18D. Figs. 18A and 18B show 
subtype A consensus Env amino acid sequence and 
nucleic acid sequence encoding same, respectively. 
10 Figs. 18C and 18D show expression of A. con env gene 
in mammalian cells (Fig. 18C - cell lysate, Fig. 18D 
- supernatant) . 

Figures 19A-19H. M. con. gag (Fig. 19A) , 
M.con.pol (Fig. 19B) , M.con.nef (Fig. 19C) and 
15 C.con.pol (Fig. 19D) nucleic acid sequences and 

corresponding encoded amino acid sequences (Figs. 
19E-19H, respectively) . 

Figures 20A-20D. Subtype B consensus gag (Fig. 
2 OA) and env (Fig.20B) genes. Corresponding amino 
20 acid sequences are shown in Figs. 2 0C and 2 0D. 

Figure 21. Expression of subtype B consensus 
env and gag genes in 293T cells. Plasmids 
containing codon-opt imized subtype B consensus 
gpl60, gpl40 r and gag genes were transfected into 
25 2 93T cells, and protein expression was examined by 
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Western Blot analysis of cell lysates. 48-hours 
post-transf ection, cell lysates were collected, 
total protein content determined by the BCA protein 
assay, and 2 fig of total protein was loaded per lane 
5 on a 4-20% SDS-PAGE gel. Proteins were transferred 
to a PVDF membrane and probed with serum from an 
HIV-1 subtype B infected individual. 

Figure 22 . Co-receptor usage of subtype B 
consensus envelopes. Pseudotyped particles 
containing the subtype B consensus gpl6 0 Env were 
incubated with DEAE-Dextran treated JC53-BL cells in 
the presence of AMD3100 (a specific inhibitor of 
CXCR4) , TAK77 9 (a specific inhibitor of CCR5) , and 
AMD3 0 0 0+TAK77 9 to determine co-receptor usage. 
NL4.3, an isolate known to utilize CXCR4 and YU-2, a 
known CCR5 -using isolate, were included as controls. 

Figures 23A and 23B. Trans complementation of 
env-def icient HIV-1 with codon-optimized subtype B 
consensus gp!60 and gpl40 genes. Plasmids 
20 containing codon-optimized, subtype B consensus 

gpl60 or gp!40 genes were co- transf ected into 293T 
cells with an HIV-l/SG3Aenv provirus . 48-hours 
post-transf ection cell supernatants containing 
pseudotyped virus were harvested, clarified in a 
25 tabletop centrifuge, filtered through a 0.2/xM 

filter, and pellet through a 2 0% sucrose cushion. 
Quantification of p24 in each virus pellet was 
determined using the Coulter HIV-1 p24 antigen 
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assay; 25 ng of p24 was loaded per lane on a 4-20% 
SDS-PAGE gel. Proteins were transferred to a PVDF . 
membrane and probed with ant i -HIV- 1 antibodies from 
infected HIV-1 subtype B patient serum. Trans 
5 complementation with a rev-dependent NL4 . 3 env was 
included, for control. Figure 23B. Infectivity of 
virus particles containing the subtype B concensus 
envelope. Infectivitiy of pseudotyped virus 
containing consensus B gpl60 or gpl40 was determined 

10 using the JC53-BL assay. Sucrose cushion purified 
virus particles were assayed by the Coulter p24 
antigen assay, and 5- fold serial dilutions of each 
pellet were incubated with DEAE-Dextran treated 
JC53-BL cells. Following a 48-hour incubation 

15 period, cells were fixed and stained to visualize (3- 
galactosidase expressing cells. Infectivity is 
expressed as infectious units per ng of p24. 

Figures 24A-24D. Neutralization sensitivity of 
■ virions containing subtype B consensus- gpl60 

20 envelope. Equivalent amounts of pseudovirions 
containing the subtype B consensus or NL4 . 3 Env 
(gpl60) (1,500 infectious units) were preincubated 
with three different monoclonal neutralizing 
antibodies and a panel of plasma samples from HIV-1 

25 wubtype B infected individuals, and then added to 
the JC53-BL cell monolayer in 96-well plates. 
Plates were cultured for two days and luciferase 
activity was measured as an indicator of viral 
infectivity. Virus infectivity was calculated by 
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dividing the luciferase units (LU) produced at each 
concentration of antibody by the LU produced by the 
control infection. The mean 50% inhibitory 
concentration (IC 50 ) and the actual % neutralization 
at each antibody dilution were then calculated for 
each virus. The results of all luciferase 
experiments were confirmed by direct counting of 
blue foci in parallel infections. Fig. 2 4A. 
Neutralization of Pseudovirions containing Subtype B 
consensus Env (gplGO) . Fig. 24B. Neutralization of 
Pseudovirions containing NL4 . 3 Env (gplGO) . 
Fig. 24C. Neutralization of Pseudovirions containing 
Subtype B consensus Env (gplGO) . Fig. 24D. 
Neutralization of Pseudovirions containing NL4 . 3 Env 
(gplSO) . 

Figures 25A and 25B. Fig. 25A. Density and p24 
analysis of sucrose gradient fractions. 0.5ml 
fractions were collected from a 20-60% sucrose 
gradient. Fraction number 1 represents the most 
dense fraction taken from the bottom of the gradient 
tube. Density was measured with a ref ractometer and 
the amount of p2 4 in each fraction was determined by 
the Coulter p24 antigen assay. Fractions 6-9, 10- 
15, 16-21, and 22-25 were pooled together and 
5 analyzed by Western Blot. As expected, virions 
sedimented at a density of 1.16-1.18 g/ml. 
Fig. 25B. VLP production by co- transf ection of 
subtype B consensus gag and env genes. 2 93T cells 
were co- transf ected with subtype B consensus gag and 
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env genes. Cell supernatants were harvested 48- 
hours post-transfection, clarified through at 20% 
sucrose cushion, and further purified through a 20- 
60% sucrose gradient. Select fractions from the 
gradient were pooled, added to 2 0ml of PBS, and 
centrifuged overnight at 100,000 x g. Resuspended 
pellets were loaded onto a 4-20% SDS-PAGE gel, 
proteins were transferred to a PVDF membrane, and 
probed with plasma from an HIV-1 subtype B infected 
individual . 

Figures 2 6A and 2 6B. Fig. 2 6A. 2 000 Con-S 
140CFI.ENV. Fig. 26B. Codon-opt imized Year 2000 
Con-S 14 0CFI.seq. 

Figure 27. Individual C57BL/6 mouse T cell 
responses to HIV-1 envelope peptides. Comparative 
immunogenicity of C0N6 gp!4 0CFI and Con-S gpl4 0CFI 
in C57BL/C mice. Mice were immunized with either 
HIV53 0 5 (Subtype A) , 2 801 (Subtype B) , -CON-6-or Con-S 
Envelope genes in DNA prime, rW boost regimens, 5 
mice" per group. Spleen cells were assayed for IFN-y 
spot-forming cells 10 days after rW boost, using 
mixtures of overlapping peptides from Envs of HIV-1 
UG37(A), MN(B), Chl9 (C) , 89.6(B) SF162 (B) or no 
peptide negative control. 

Figures 28A-28C. Fig. 28A. Con-B 2003 Env. pep 
(841 a. a.). Amino acid sequence underlined is the 
fusion domain that is deleted in 14 0CF design and 
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the "W" underlined is the last 'amino acid at the 
C-terminus , all amino acids after the "W" are 
f deleted in the 140CF design. Fig. 28B. Con-B- 
140CF.pep (632 a. a.). Amino acids in bold identify 
5 the junction of the deleted fusion cleavage site. 
Fig. 28C. Codon-opt imized Con-B 140CF.seq 
(1927 nt . ) . 

Figures 29A-29C. Fig. 29A. CON_OF_CONS - 2 0 0 3 
.(829 a. a.) . Amino acid sequence underlined is the 

10 fusion domain that is deleted in 14 0CF design and 
the »W» underlined is the last amino acid at the 
C-terminus, all amino acids after the "W" are 
deleted in the 140CF design. Fig. 29B. ConS-2003 
140CF.pep (620 a. a.). Amino acids in bold identify 

is the junction of the deleted fusion cleavage site. 

Fig. 29C. CODON-OPTIMIZED ConS-2003 140CF.seq (18 91 
nt . ) . 



Figures 30A-30C. Fig. 30A. CONSENSUS_A1-2 00 3 
(845 a. a.) . Amino acid sequence underlined is the 
20 fusion domain that is deleted in 140CF design and 
the "W" underlined is the last amino acid at the 
C-terminus, all amino acids after the »W" are 
deleted in the 140CF design. Fig. 30B. Con-Al-2003 
140CF.pep (629 a. a.). Amino acids in bold identify 
25 the junction of the deleted fusion cleavage site. 
Fig. 30C. CODON-OPTIMIZED Con-Al-2 003 . seq . 
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Figures 31A-31C. Fig. 31A. CONSENSUS_C-2 0 03 
(835 a. a.). Amino acid sequence underlined is the 
fusion domain that is deleted in 140CF design and. 
the "W" underlined is the last amino acid at the 
5 C-terminus, all amino acids after the "W n are 

deleted in the 140CF design. Fig. 31B. Con-C 2003 
140CF.pep (619 a. a.). Amino acids in bold identify 
the junction of the deleted fusion cleavage site. 
Fig. 31C. CODON-OPTIMIZED Con-C-2003 (140 CF (1,888 
10 nt . ) . 

Figures 3 2A-3 2C. Fig. 3 2A. CONSENSUS_G-2 003 
(842 a. a.). Amino acid sequence underlined is the 
fusion domain that is deleted in 14 0CF design and 
the M W n underlined is the last amino acid at the 
15 C-terminus, all amino acids after the "W M are 

deleted in the 140CF design. Fig. 32B. Con-G-2003 
140CF.pep (626 a. a.). Amino acids in bold identify 
the junction of the deleted fusion cleavage site. 
Fig. 3 2C. CODON-OPTIMIZED Oon-G- 2 0 03 . seq . 

20 Figures 33A-33C. Fig. 33A. CONSENSUS__01_AE- 

2003 (854 a. a.). Amino acid sequence underlined is 
the fusion domain that is deleted in 14 0CF design 
and the ,! W" underlined is the last amino acid at 
the C-terminus, all amino acids after the "W" are 

25 deleted in the 140CF design. Fig. 33B. Con-AEOl- 
2003 140CF.pep (638 a. a.). Amino acids in bold 
identify the junction of the deleted fusion cleavage 
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site. Fig. 33C. CODON- OPTIMIZED Con-AE01-2 0 03 . seq. 
(1945 nt.) . 

Figures 34A-34C. Fig. 34A. Wild-type subtype 
A Env. 00KE_MSA4 076-A (Subtype A, 891 a. a.). Amino 
5 acid sequence underlined is the fusion domain that 
is deleted in 140CF design and the "W" underlined 
is the last amino acid at the C-terminus, all amino 
acids after the "W" are deleted in the 140CF design. 
Fig. 34B. 0 0 KE_MS A4 01 G -A 140CF.pep (647 a. a.). 
io Amino acids in bold identify the junction of the 
deleted fusion cleavage site. Fig. 34C. CODON- 
OPTIMIZED OOKE_MSA40 7 6-A 140CF.seq. (1972 nt.) . 

Figures 35A-35C. Fig. 35A. Wild-type subtype 
E. QH0515.1g gplGO (861 a. a.). Amino acid sequence 

15 underlined is the fusion domain that is deleted in 
140CF design and the "W" underlined is the last 
amino acid at the C-terminus, all amino acids after 
the "W" are deleted in the 140CF design. Fig. 35B. 
QH0515.1g 140CF (651 a. a.). Amino acids in bold 

20 identify the junction of the deleted fusion cleavage 
site. Fig. 35C. CODON-OPTIMIZED QH0515.1g 
140CF.seq (1984 nt . ) . 

Figures 3 6A-3 6C. Fig. 3 6A. Wild- type subtype 
C. DU123.6 gpl60 (854 a.a.}. Amino acid sequence 
25 underlined is the fusion domain that is deleted in 
14 0CF design and the "W" underlined is the last 
amino acid at the C-terminus, all amino acids after 
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the »W" are deleted in the 140CF design. Fig. 36B. 
DU123.6 140CF (638 a. a.). Amino acids in bold 
identify the junction of the deleted fusion cleavage 
site. Fig. 36C. CODON- OPTIMIZED DU123..6 140CF.seq 
5 (1945 nt . ) . 

Figures 37A-37C. Fig. 37A. Wild-type subtype. 
CRF01_AE. 9 7 CNGX2 F - AE (854 a. a.). Amino acid 
sequence underlined is the fusion domain that is 
deleted in 140CF design and the »W" underlined is 

io the last amino acid at the C- terminus, all amino 

acids after the "W" are deleted in the 140CF design. 
Fig. 37B. 97CNGX2F-AE 140CF.pep (629 a. a.). Amino 
acids in bold identify the junction of the deleted 
fusion cleavage site. Fig. 37C. CODON-OPTIMIZED 

15 97CNGX2F-AE 140CF.seq (1921 nt . ) . 

Figures 3 8A-3 8C. Fig. 3 8A. Wild- type DRCBL-G 
(854 a. a.). Amino acid sequence underlined is the 
fusion domain that is deleted in 140CF design and 
the "W" underlined is the last amino acid at the 

20 C- terminus, all amino acids after the "W" are 

deleted in the 140CF design. Fig. 3 8B. DRCBL-G 
140CF.pep (630 a. a.). Amino acids in bold identify 
the junction of the deleted fusion cleavage site. 
Fig. 3 8C. CODON-OPTIMIZED DRCBL-G 14 0CF.seq (1921 

25 nt . ) . 
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Figures 39A and 39B. Fig. 3 9A. 2003 Con-S 
Env. Fig. 3 9B. 2 0 03 Con-S Env.seq.opt. 
(Seq.opt. = codon optimized encoding sequence.) 

Figures 40A and 40B. Fig. 40A. 2003 M. 
5 Group. Anc Env. Fig. 40B. 2003 M. Group . anc 

Env.seq.opt. (Seq.opt. = codon optimized encoding 
sequence . ) 

Figures 41A and 4 IB. Fig. 41A. 2 0 03 CON_Al 
Env. Fig. 41B. 2003 C0N_A1 Env.seq.opt. 
10 (Seq.opt. = codon optimized encoding sequence.) 

Figures 42A and 42B. Fig. 42A. 2003 Al.Anc 
Env. Figs. 42B. 2003 Al . anc Env.seq.opt. 
(Seq.opt. = codon optimized encoding sequence.) 

Figures 4 3A and 43B. Fig. 43A. 2 0 03 CON__A2 
15 Env. Fig. 43B. 2 0 03 C0N__A2 Env.seq.opt. 

(Seq.opt. = codon optimized encoding sequence.) 

Figures 44A and 44B. Fig. 44A. 2 0 03 CON_JB 
Env. Fig. 44B. 2003 C0N_B Env.seq.opt. 
(Seq.opt. = codon optimized encoding sequence.) 

20 Figures 45A and 45B. Fig. 45A. 2003 B . anc 

Env. Figs. 45B. 2003 B . anc Env.seq.opt. 
(Seq.opt. = codon optimized encoding sequence.) 
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Figures 4 6A and 4 6B. Fig. 4 6A. 2 0 03 CON__C 
Env. Fig. 4 6B. 2 0 03 CON_C Env.seq.opt. 
(Seg.opt. = codon optimized encoding sequence.) 

Figures 47A and 47B. Fig. 47A. 2003 C . anc 
5 Env. Fig. 4 7B. 2 0 03 C . anc Env.seq.opt. 

(Seq.opt. = codon optimized encoding sequence.) 

Figures 4 8A and 4 8B. Fig. 4 8A. 2 0 03 C0N_D 
Env. Fig. 4 8B. 2 0 03 CON__D Env.seq.opt. 
(Seq.opt. = codon optimized encoding sequence.) \ 

10 Figures 49A and 49B. Fig. 49A. 2003 C0N_F1 

Env. Fig. 4 9B. 2 0 03 CON_Fl Env.seq.opt. 
(Seq.opt. = codon optimized encoding sequence.) 

Figures 5 OA and SOB. Fig. 5 OA. 2 003 CON_F2 
Env. Fig. 5 0B. 2 0 03 CON_F2 Env.seq.opt. 
15 (Seq.opt.. = codon optimized encoding sequence.) 

Figures 51A and 5 IB. Fig. 51A. 2 0 03 CON_G 
Env. Fig. 51B. 2 0 03 C0N_G Env.seq.opt. 
(Seq.opt. = codon optimized encoding sequence.) 

Figures 52A and 52B. Fig. 52A. 2003 C0N_H 
20 Env. Fig. 52B. 2003 CON_H Env.seq.opt. 

(Seq.opt. = codon optimized encoding sequence.) 
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Figures 53A and 53B. Fig. 53A. 2003 CON_01_AE 
Env. Fig. 53B. 2003 CON_01__AE Env.seq.opt . 
(Seq.opt. = codon optimized encoding sequence.) 

Figures 54A and 54B . Fig. -54A. 2003 CON_02_AG 
5 Env. Fig. 54B . 2 003 CON__02_AG Env.seq.opt. 

(Seq.opt. = codon optimized encoding sequence.) 

Figures 55A and 5 5B. Fig. 5 5A. 2 0 03 CON_03__AB 
Env. Fig. 55B. 2003 CON_03_AB Env.seq.opt. 
(Seq.opt. = codon optimized encoding sequence.) 

10 Figures 56A and 56B. Fig. 56A. 2003 

CON_04_CPX Env. Fig. 56B. 2003 CON_04_CPX 
Env.seq.opt. (Seq.opt. = codon optimized encoding 
sequence . ) 

Figures 57A and 5 7B. Fig. 5 7A. 2 003 
15 CON_0 6_CPX Env. Fig. 5 7B. 2 0 03 CON_06_CPX 

Env.seq.opt. (Seq.opt. = codon optimized encoding 
sequence . ) 

Figures 58A and 58B. Fig. 5 8A. 2 003 CON__0 8_JBC 
Env. . Fig. 58B. 1 2003 CON_08_BC Env.seq.opt. 
20 (Seq.opt. = codon optimized encoding sequence.) 

Figures 5 9A and 5 9B. Fig. 5 9A. 2 0 03 CON_10_CD 
Env. Fig. 5 9B. 2 0 03 CON_10_CD Env.seq.opt. 
(Seq.qpt. = codon optimized encoding sequence.) 
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Figures 60A and 60B.. Fig. 60A. 2003 
CON_ll_CPX Env. Fig. 6 OB. 2 0 03 CON__ll_CPX 
Env.seq.opt. (Seq.opt. = codon optimized encoding 
sequence . ) 

Figures 61A and 61B. Fig. 61A. 2003 CON_12_BF 
Env. Fig. 61B. 2003 CON_12_BF Env.seq.opt. 
(Seq.opt. = codon optimized encoding sequence.) 

Figures 62A and 62B. Fig. 62A. 2003 CON_14__BG 
Env. Fig. 62B. 2 0 03 CON_14_BG Env.seq.opt. 
(Seq.opt. = codon optimized encoding sequence.) 

Figures 63A and 63B. Fig. 63A. 2 003_CON_S 
gag . PEP . Fig. 63B. 2 0"03_CON_S gag . OPT . 
(OPT = codon optimized encoding sequence.) 



Figures 64A and 64B. 
15 2 00 3JVI.GROUP. anc gag. PEP. 
2 003_M.GROUP.anc gag. OPT. 
encoding sequence . ) 



Fig. 64A. 
Fig. 64B. 

(OPT = codon optimized 



Figures 65A-65D. Fig. 65A. 2003_CON_A1 
gag. PEP. Fig. 65B. 2 0 03_CON_A1 gag. OPT. Fig. 65 C. 
20 2003_Al.anc gag. PEP. Fig. 65D. 2003_Al.anc 

gag. OPT. (OPT = codon optimized encoding sequence.) 



27 



WO 2005/028625 

PCT7US2OO4/030397 



Figures 66A and 66B. Fig. 66A. 2003 CON A2 
gag . PEP . Fig. 6 6B. 2 0 03_CON_A2 gag. OPT. 
(OPT = codon optimized encoding sequence.) 

Figures 67A-6 7D.. Fig. 67 A. 2 0 03_CON_B 
5 gag. PEP. Fig. 6 7B. 2 0 03_CON_B gag. OPT. Fig. 67C . 
2003_B.anc gag. PEP. Fig. 67D. 2003_B.anc gag . OPT . 
(OPT = codon optimized encoding sequence.) 

Figures 6 8A-68D. Fig. 6 8A. 2 003_CON_C 
gag. PEP. Fig. 68B. 2003_CON__C gag . OPT Fig. 68G. 
10 2003_C.anc.gag.PEP. Fig. 68D. 2 0 03_C . anc . gag . OPT . 
(OPT = codon optimized encoding sequence.) 

Figures 69A and 69B. Fig. 69A. 2 003_CON_D 
gag. PEP. Fig. 6 9B. 2 0 03_CONJD gag . OPT . 
(OPT = codon optimized encoding sequence.) 

15 Figures 70A and 70B. Fig. 70A. 2 0 03_CON_F 

gag. PEP. Fig. 7 0B. 2 0 03_CON_F gag. OPT. 
(OPT = codon optimized encoding sequence.) 

Figures 71A and 71B. Fig. 71A. 2003_CON_G 
gag. PEP. Fig. 7iB. 2 0 03_CON_G gag. OPT. 
20 (OPT = codon optimized encoding sequence.) 

Figures 72A and 72B. Fig. 72A. 2 003_CON_H 
gag. PEP. Fig. 72B. 2 0 03_CON__H gag . OPT . 
(OPT = codon optimized encoding sequence.) 
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Figures 73 A and 7 3B. Fig. 73A. 2 0 03_CON_K 
gag. PEP. Fig. 73B. 2 003_CON_K gag . OPT . 
(OPT - codon optimized encoding sequence.) 

Figures 74A and 74B. Fig. 74A. 2 0 03_CON_01_AE 
5 gag. PEP. Fig. 7B . 20 03_CON_01_AE gag. OPT. 
(OPT = codon optimized encoding sequence.) 

Figures 75A and 75B. Fig. 75A. 2 0 03_CON_02_AG 
gag. PEP. Fig. 75B. 2 0 03_CON_02_AG gag . OPT . 
(OPT = codon optimized encoding sequence.) 

io Figures 76A and 76B. Fig. 76A. 

2 003_CON_03_ABG gag . PEP . Fig. 76B. 2 0 03_CON_03_ABG 
gag. OPT. (OPT = codon optimized encoding sequence.) 

Figures 77A and 77B. Fig. 77A. 
2 003_CON_04_CFX gag . PEP . Fig. 7 7B. 2 0 03 CON_04_CFX 
is gag. OPT. (OPT = codon optimized encoding sequence.) 

i 

Figures 78A and 78B. Fig. 78A. 
2 0 03_CON_0 6_CPX gag . PEP . ' Fig. 7 8B. 2003_CON_06_CPX 
gag. OPT. (OPT = codon optimized encoding sequence.) 

Figures 79A and 79B. Fig. 79A. 2 003_CON_0 7_BC 
20 gag. PEP. Fig. 79B. 2 0 03_CON_07_BC gag . OPT . 
(OPT = codon optimized encoding sequence.) 
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Figures 80A and 80B. Fig. 80A. 2 0 03_CON_0 8_BC 
gag. PEP. Fig. 8 OB. 2 0 03_CON_0 8_BC gag . OPT . 
(OPT = codon optimized encoding sequence.) 

, Figures 81A and 81B. Fig. 81A. 2 0 03_CON_10_CD 
5 gag . PEP . Fig. 8 IB. 2 0 03_CON__10_CD gag. OPT. 
(OPT = codon optimized encoding sequence.) 

Figures 82A and 82B. Fig. 82A. 
2 0 03_CON_11_CPX gag. PEP. Fig. 82B. 2 003_CON_11_CPX 
gag. OPT. (OPT = codon optimized encoding sequence.) 

io Figures 83A and 83B. Fig. 83A. 

2 0 03_CON_12_BF.gag.PEP. Fig. 83B. 

2 003_CON_12_BF.gag.OPT. (OPT = codon optimized 
encoding sequence . ) 

Figures 84A and 84B. Fig. 84A. 2 0 03_CON_14_BG 
15 gag. PEP. Fig. 84B. 2 0 03_CON_14_BG gag . OPT . 
(OPT = codon opt imized encoding sequence.) 

Figures 85A and 85B. Fig. 85A. 2003_CONS 
nef.PEP. Fig. 85B. 2003_CONS nef.OPT. 
(OPT = codon optimized encoding sequence.) 

20 Figures 86A and 86B. Fig. 86A. 2003_M 

GROUP. anc nef.PEP. Fig. 8 6B. 2 0 03_M 

GROUP. anc. nef.OPT. (OPT = codon optimized encoding 
sequence . ) 
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Figures 87A and 8 7B. Fig. 8 7A. 2 003_CON_A 
nef.PEP. Fig. 8 7B. 2 0 03_CON_A nef .OPT. 
(OPT = codon optimized encoding sequence.) 

Figures 88A-88D. Fig. 8 8A. 2 0 03_CON_A1 
5 nef.PEP. Fig. 88B. 2 003_CON_A1 nef. OPT. Fig. 88C. 
2003_Al.anc nef.PEP. Fig. 88D. 2003_Al.anc 
nef .OPT. (OPT = codon optimized encoding sequence.) 

Figures 8 9A and 8 9B. Fig. 8 9A. 2 0 03_CON_A2 
nef.PEP. Fig. 8 9B. 2 003_CON_A2 nef. OPT. 
10 (OPT = codon optimized encoding sequence.) 

Figures 90A-90D. Fig. 90A. 2003_CON_B 
nef.PEP. Fig. 90B. 2003_CON-B nef. OPT. Fig. 90C . 
2003_B.anc nef.PEP. Fig. 90D. 2003_B.anc nef. OPT. 
(OPT = codon optimized encoding sequence.) 

_ 15_ _.. Figures _91A and 91B. Fig. 91A. 2 0 03__CON_02_AG 
nef.PEP. Fig. 91B. 2 0 03_CON_02_AG nef .OPT. 
(OPT = codon optimized. encoding sequence.) 

Figures 92A-92D. Fig. 92A. 2 0 03_CON_C 
nef.PEP. Fig. 92B. 2 0 03_CON_C nef. OPT. Fig. 92C . 
20 2003_C.anc nef.PEP. Fig. 92D. 2003_C.anc nef. OPT. 
(OPT = codon optimized encoding sequence.) 
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Figures 93A and 93B. Fig. 93A. 2 0 03_CON__D 
nef.PEP. Fig. 93B. 2 003_CON__D nef .OPT. 
(OPT = codon optimized encoding sequence.) 

Figures 94A and 94B. Fig. 94A. 2 0 03_CON_F1 
5 nef.PEP. Fig. 94B . 2 0*03_CONJF1 nef. OPT. 
(OPT = codon optimized encoding sequence.) 

Figures 95A and 95B. Fig. 95A. • 2003_CON_F2 
nef.PEP. Fig. 95B. 2 0 03,_CON_F2 nef. OPT. 
(OPT = codon optimized encoding sequence.) 

10 Figures 96A and 96B. Fig. 96A. 2003_CON__G 

nef.PEP. Fig. 96B. 2 003_CON_G nef .OPT. 
(OPT = codon optimized encoding sequence.) 

Figures 97A and 97B. Fig. 97A. 2 0 03_CON_H 
nef.PEP. Fig. 9 7B. 2 0 03_CON__H nef .OPT. 
15 (OPT = codon optimized encoding sequence.) 

Figures 98A and 9 8B. Fig. 9 8A. 2 00 3_CON_01_AE 
nef.PEP. Fig. 98B. 2 003_CON_01_AE nef. OPT. 
(OPT = codon optimized encoding sequence.) 

Figures 99A and 99B. Fig. 99A. 2 003_CON_03_AE 
20 nef.PEP. Fig. 99B. 2 0 03_CON_03_AE nef. OPT. 
(OPT = codon optimized encoding sequence.) 
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Figures 100A and 100B. Fig. 100A. 
2 0 03_CON_04_CFX nef.PEP. Fig. 100B. 
2 003__CON_04_CFX nef.OPT. (OPT = codon optimized 
encoding sequence . ) 

5 Figures 101A and 101B. Fig. 101A. 

2 00.3_CON_0 6_CFX nef.PEP. Fig. 10 IB. 
2 00 3_CON_0 6_CFX nef.OPT. (OPT = codon optimized 
encoding sequence . ) 

Figures 102A and 102B. Fig. 102A. 
10 2 003_CON_0 8_BC nef.PEP. Fig. 102B. 2 0 03_CON_0 8_BC 
nef.OPT. (OPT = codon optimized encoding sequence.) 

Figures 103A and 103B. Fig. 103A. 
2 003_CON_10_CD nef.PEP. Fig. 103B. 2 0 03_CON_10_CD 
nef.OPT. (OPT = codon optimized encoding sequence.) 

15 Figures 104A and 104B. Fig. 104A. 

2 003_CON_11_CFX nef.PEP. Fig. 104B. 
2 003_CON_11_CFX nef.OPT. (OPT = codon optimized 
encoding sequence . ) 

Figures 105A and 105B. Fig. 105A. 
20 2003_CON_12_BF nef.PEP. Fig. 105B. 2 003_CON_12_BF 
nef.OPT. (OPT = codon optimized encoding sequence.) 
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Figures 10 6A and 10 6B. Fig. 106A. 
2 0 03_CON_14_BG nef . PEP . Fig. 106B. 2003_CON_14_BG 
nef.OPT. (OPT = codon optimized encoding sequence,) 

Figures 10 7A and 10 7B. Fig. 107A. 2 0 03_CON_S 
5 pol.PEP. Fig. 10 7B. 2003_CON_S pol.OPT. 
(OPT = codon optimized encoding sequence.) 

Figures 108A and 108B. Fig. 108A. 2003_M 
GROUP anc pol.PEP. Fig. 10 8B. 2003_3YLGROUP anc 
pol.OPT. (OPT = codon optimized encoding sequence.) 

10 Figures 109A-109D. Fig. 10 9A. 2003_CON_A1 

pol.PEP. Fig. 109B. 2 0 03_CON_A1 pol . OPT . 
Fig. 109C. 2003_Al.anc pol.PEP. Fig. 109D. 
2 003_Al.anc pol.OPT. (OPT = codon optimized 
encoding sequence . ) 

15 Figures 110A and HOB. Fig. 110A. 2003__CON_A2 

pol.PEP. Fig. 11 OB. 2 00 3_CON__A2 pol.OPT. 
(OPT = codon optimized encoding sequence.) 

Figures 111A-111D. Fig. 111A. 2003_CON_B 
pol.PEP. Fig. 111B. 2 003_CON_B pol.OPT. Fig. 
20 111C. 2003_B.anc pol.PEP. Fig. HID. 2003_B.anc 

pol.OPT. (OPT = codon optimized encoding sequence.) 

Figures 112A-112D. Fig. 112A. 2003_CON_C 
pol.PEP. Fig. 112B. 2 003_CON_C pol . OPT . 
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Fig. 112C. 2003_C.anc pol . PEP . Fig. 112D. 

2 003_C.anc pol. OPT. (OPT = codon optimized encoding 

sequence . ) 

Figures 113A and 113B. Fig. 113A. 2 0 03_CON_D 
5 pol. PEP. Fig. 113B. 2 0 03_CON_D pol. OPT. 
(OPT = codon optimized encoding sequence.) 

Figures 114 A and 114B. Fig. 114A. 2003_CON_F1 
pol . PEP . Fig. 114B. 2 003_CON_F1 pol . OPT . 
(OPT = codon optimized encoding sequence.) 

lo Figures 115A and 115B. Fig. 115A. 2 003_CON_F2 

pol . PEP . Fig. 115B. 2003_CON_F2 pol . OPT . 
(OPT = codon optimized encoding sequence.) 

Figures 116A and 116B. Fig. 116A. 2003_CON_G 
pol. PEP. Fig. 116B. 2 003_CON_G pol. OPT. 
15 (OPT _= codon optimized encoding sequence.) 

Figures 117A and 117B. Fig. 117A. 2003_CON_H 
pol. PEP. Fig. 117B. 2003_CON_H pol .OPT . 
(OPT = codon optimized encoding sequence.) 

Figures 118A and 118B. Fig. 118A. 
20 2 003_CON_01_AE pol . PEP . Fig. 118B. 2 0 03_CON_01_AE 
pol. OPT. (OPT = codon optimized encoding sequence.) 
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Figures 119A and 119B. Fig. 119A. 
2 003_CON_02_AG pel . PEP . Fig. 119B. 2 0 03_CON_02_AG 
pol.OPT. (OPT = codon optimized encoding sequence.) 

Figures 12 OA and 12 OB. Fig. 12 OA. 
5 2 00 3_CON_0 3_AB pol .PEP. Fig. 120B. 2 0 03_CON_03_AB 
pol.OPT. (OPT = codon optimized encoding sequence.) 

Figures 12 1A and 121B. Fig. 121A. 
2 00 3_CON_04_CPX pol . PEP . Fig. 12 IB. 
2 00 3_CON_04_CPX pol.OPT. (OPT = codon optimized 
10 encoding sequence . ) 

Figures 122A and 122B. Fig. 122A. 
2 00 3_CON_06_CPX pol . PEP . Fig. 122B. 
2 00 3_CON_0 6_CPX pol.OPT. (OPT = codon optimized 
encoding sequence . ) 

15 Figures 123A and 123B. Fig. 123A. 

2 00 3_CON_0 8__BC pol .PEP. Fig. 123B. 2 003_CON_08_BC 
pol.OPT. (OPT = codon optimized encoding sequence.) 

Figures 124A and 124B. Fig. 124A. 
2 00 3_CON_10_CD pol . PEP. Fig. 124B. 2 0 0 3_CON_l 0_CD 
20 pol.OPT. (OPT = codon optimized encoding sequence.) 

Figures 125A and 125B. Fig. 125A. 
2 00 3_CON_11_CPX pol. PEP. Fig. 125B. 
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2 003_ CON_ll_CPX pol.OPT. (OPT = codon optimized 
encoding sequence . ) 

Figures 126A and 126B. Fig. 126A. 
2 0 0 3_COM__l 2_BF pol . PEP . Fig. 126B. 2 003_CON_12_BF 
5 pol.OPT. (OPT = codon optimized encoding sequence.) 

Figures 127A and 127B. Fig. 127A. 
2 003_CON_14J3G pol . PEP . Fig. 127B. 2 0 03_CON_14_BG 
pol.OPT. (OPT = codon optimized encoding sequence.) 

DETAILED DESCRIPTION OF THE INVENTION 

10 The present invention relates to an immunogen 

that induces antibodies that neutralize a wide 
spectrum of human immunodeficiency virus (HIV) 
primary isolates and/or that induces a T cell 
response. The immunogen comprises at least one 

is consensus or ancestral immunogen (e.g., Env, Gag, 
Nef or Pol ) , or portion - or variant -thereof . The 
invention also relates to nucleic acid sequences 
encoding the consensus or ancestral immunogen, or 
portion or variant thereof . The invention further 

20 relates to methods of using both the immunogen and 
the encoding sequences. While the invention is 
described in detail with reference to specific 
consensus and ancestral immunogens (for example, to 
a group M consensus Env) , it will be appreciated 

25 that the approach described herein can be used to 
generate a variety of consensus or ancestral 
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immunogens (for example, envelopes for other HIV-1 
groups (e.g., N and O) ) . 

In accordance with one embodiment of the 
invention, a consensus env gene can be constructed 
5 by generating consensus sequences of env genes for 
each subtype of a particular HIV-1 group (group M 
being classified into subtypes A-D, F-H, J an K) , 
for example, from sequences in the Los Alamos HIV 
Sequence Database (using, for example, MASE 
10 (Multiple Aligned Sequence Editor) ) . A consensus 
sequence of all subtype consensuses can then be 
generated to avoid heavily sequenced subtypes 
(Gaschen et al , Science 296:2354-2360 (2002), Korber 
et al, Science 288 : 1789-1796 - (2000) ) . In the case 
15 of the group M consensus env gene described in 

Example 1 (designated C0N6) , five highly variable 
regions from a CRF08_BC recombinant strain (98CN006) 
(VI, V2 , V4 , V5 and a region in cytoplasmic domain 
of gp41) are used to fill in the missing regions in 
20 the sequence (see, however, corresponding regions 
for Con-S) . For high levels of expression, the 
codons of consensus or ancestral genes can be 
optimized based on codon usage for highly expressed 
human genes (Haas et al , Curr. Biol. 6:315-324 
25 (2000), Andre et al, J. Virol. 72:1497-1503 (1998)). 

With the Year 1999 consensus group M env gene, 
C0N6, it has been possible to demonstrate induction 
of superior T cell responses by CON6 versus wild- 
type B and C env by the number of ELI SPOT 
30 Y" inter "f eron spleen spot forming cells and the 
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number of epitopes recognized in two strains of mice 
(Tables 1 and 2 show the data in BALB/c mice) . The 
ability of C0N6 Env protein to induce neutralizing 
antibodies to HIV-1 primary isolates has been 
5 compared to that of several subtype B Env. The 
target of neutralizing antibodies induced by. CONG 
includes several non-B HIV-1 strains. 



Table 1. T cell epitope mapping of CON6, JRFL and 96ZM651 
Env immunogen in BALB/c mice 



Immunogen T ceIl 

Peptide C0NB JRFL (B) g6ZM651 (C) response 



CON 6 (group M consensus) 










16 


DTEVHNVWATHACVP . 


+ 






CD4 


48 
49 


KNSSEYYRLINCNTS 

EYYRLINCNTSAITQ 


+ 




+ 


CD4 


53 
54 

62 


CPKVSFEPIPIHYCA 

SFEPIPIHYCAPAGF 

NVSTVQCTHGIKPW 








CD4 
CD4 


104 
105 


ETITLPCRIKOHNM 

LPCRIKQIINMWQGV 


+ 






CD8 


130 
131 


GIVQQQSNLLRAIEA 

VQOSNLLRAIEAQQHL 


+ 






CD4 


134 
135 


AQQHLLQLTVWG EKQLQ 

LQLTVWGIKQLQARVI. 


+ 






CD4 


Subtype B (MN) 










6223 
6224 


AKAYDTEVHNVWATO 

DTEVHNVWATQACVP 


+ 






CD4 


. 6261 
6262 


ACPKISFEPIPIHYC 

ISFEPIPIHYCAPAG 


+ 






CD4 


62B6 
6287 


RKRIHIGPGRAFYTT 
HIGPGRAFYTTKNII 




+ 




CD8 


6346 
6347 


IVQQQNNLLRAIEAQ 

QNNLLRAIEAQQHMt 








CD4 


Subtype C (Chn19) 








CD4 


4B34 


VPVWKEAKTTLFCASDAKSY 






+ 


4B36 


GKEVHNVWATHACVPTDPNP 


+ 




+ 


CD4 


4846 


SSENSSEYYRUNCNTSAtT 


+ 






CD4 


4854 


STVOCTHGIKPWSTQLLLN 


+ 






CD4 


4884 
4885 


QQSNLLRAIEAQQHLLQLTV 
AQQHLLQLTVWGIKQLQTRV 


-f 
+ 






CD4 
CD4 
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Table 2. T cell epitope mapping of CON6.gpl20 
immunogen in C5 7BL/6 mice 



Peptide 



Peptide sequence 



T cell response 



CON 6 (consensus) 
2 
3 

16 
53 
97 
99 



GIQRWCQHLWRWGTM 

NCQHLWRWGTMILGM 

DTEVHNWATHACVP 

CPKVSFEPIPIHYCA 

FYCNTSGLFNSTWMF 

FNS TWMFNGT YMFNG 



CD 8 

CD 4 
CD 4 
CD 8 
CD 8 



Subtype B (MN) 
6210 
6211 

6232 

6262 

6290 
6291 



G IRJRNYQHWWG WGTM 

NYQHWWGWGTMLLGL 

NMW KNNMVE QMHED 1 

1SFEPIP IHYCAPAG 

N1IGTIRQAHCNISR 

TIRQAHCNISRAKWN 



CD 8 

CD 4 
CD 4 
CD 4 



Subtype C (Chn 19) 
4830 

5446 

4836 

4862 

4888 



MRVTG IRKNYQHLWRWGTML 
RWGTMLLGMLMI CS AAEN 
G KEVHNVWATHACVPTD PiSTP 
GD I RQAHCN I S KDKWNETLQ 
LLGIWGCSGKLICTTTVPWN 



CD 8 
CDS 
CD 4 
CD4 
CD 8 



For the Year 200 0 consensus group M erzv gene, 
.5" Con-S, the Con-S envelope has been shown to be as 
immunogenic as the CONG envelope gene in T cell y 
interferon ELISPOT assays in two strains of mice 
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(the data for C57BL/6 are shown in Fig. 27) . 
Furthermore, in comparing CON6 and Con-S gpl4 0 Envs 
as protein immunogens for antibody in guinea pigs 

(Table 3) , both gpl4 0 Envs were found to induce 
5 antibodies that neutralized subtype B primary 

isolates. However, Con-S gpl40 also induced robust 
neutralization of the subtype C isolates TV-1 and DU 
123 as well as one subtype A HIV-1 primary isolate, 
while CON6 did not. 
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As the next iteration of consensus immunogens , 
and in recognition of the fact that a practical HIV- 

I immunogen can be a pol^alent mixture of either 
5 several subtype consensus genes, a mixture of 

subtype and consensus genes, or a mixture of 
centralized genes and wild type genes, a series of 

II subtype consensus, and wild type genes have been 
designed from subtypes A, B, C, CRF AE01, and G as 

10 well as a group M consensus gene from Year 2 003 Los 
Alamos National Database sequences. The wild type 
sequences were chosen either because they were known 
to come from early transmitted HIV-1 strains (those 
strains most likely to be necessary to be protected 

15 against by a vaccine) or because they were the most 
recently submitted strains in the database of that 
subtype. These nucleotide and amino acid sequences 
are shown in Figures 28-38 (for all 140CF designs 
shown, 140CF gene can be flanked with the 5* 

20 sequence 11 TTCAGTCGACGGCCACC 11 that contains a Kozak 
sequence ( GCCACCATGG/ A) and Sail site and 3 1 
sequence of T AAAG AT C T TAC AA. containing stop codon and 
Bgrlll site) . Shown in Figures 39-62 are 2003 
centralized (consensus and ancestral) HIV-1 envelope 

25 proteins and the codon optimized gene sequences. 

Major differences between C0N6 gpl40 (which 
does not neutralize non-clade B HIV strains) and 
Con-S gpl4 0 (which does induce antibodies that 
neutralize non-clade B HIV strains) are in Con-S VI, 

3 0 V2, V4 and V5 regions. For clade B strains, 

peptides of the V3 region can induce neutralizing 
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antibodies (Haynes et al, J. Immunol. 151 : 164 6*165 3 
(1993)). Thus, construction of Th-Vl , Th-V2 , Th- 
V4, Th-V5 peptides can be expected to give rise to 
the desired broadly reactive anti-non-clade B 
5 neutralizing antibodies. Therefore, the Th-V 

peptides set forth in Table 4 are contemplated for 
use as a peptide immunogen(s) derived from Con-S 
gpl40. The gag Th determinant (GTH, Table 4) or any 
homologous GTH sequence in other HIV strains, can be 

10 used to promote immunogenicity and the C4 region of 
HIV gpl20 can be used as well ( KQ I INMWQ WGKAMYA ) or 
any homologous C4 sequence from other HIV strains 
(Haynes et al, J. Immunol. 151:1646-1653 (1993)). 
Con-S VI, V2 , V4 , V5 peptides with an N-terminal 

15 helper determinant can be used singly or together, 
when formulated in a suitable adjuvant such as 
Corixa ' s RC52 9 (Baldridge et al, J. Endotoxin Res. 
8:453-458 (2002)), to induce broadly cross reactive 
neutralising antibodies to non-clade B isolates. 
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Table 4 








1) 


GTH Con-S VI 132-150 


YKRWIILGLNKIVRMYTNVNVTNTTNNTEEKGEIKN 


2) 


GTH Con-S V2 157-189 


YKRWIILGLNKIVRMYTBRDKKQKVYALFYRLDWPIDDNNNNSSNYR 


3) 


GTH Con-S V3 294-315 


YKRWIILGLNKIVRMYTRPNNNTRKSIRIGPGQAFYAT 


4) 


GTH Con-S V4 381-408 


YKRWIILGLNKIVRMYNTSGLFNSTWIGNGTKNNNNTNDTETLP 


5) 


GTH Con-S V5 447-466 


YKRWIILGLNKIVRMYRDGGNNMTNETEIFRPGGGD 




GTH Con-6 VI 132-150 


YKRWIILGLNKIVRMYNVRNVSSNGTETDNEBKN 


7) 


GTH Con-6 V2 157-196 


YKRWIILGLNKIVRMYTELRDKKQKVYALFYRLDWPIDDKNSSEISGKNSSEYYR 


8) 


GTH-Con6 V3 301-322 


YKRWIILGLNKIVRMYTRPNNNTRKSIHIGPGQAFYAT 


9) 


GTH Con-6 V4 388-418 


YKRWIILGLNKIVRMYNTSGLFNSTWMFNGTYMFNGTKDNSETTTLP 


10 


GTH Con 6 V5 457-477 


YKRWIILGLNKIVRMYRDGGNNSNKNKTETFRPGGGD 



It will be appreciated that the invention 
includes portions and variants of the sequences 
specifically disclosed herein. For example, forms 
5 of codon optimized consensus encoding sequences can 
be constructed as gpl40CF, gpl40 CFI , gpl20 or gplGO 
forms with either gpl20/41 cleaved or uncleaved. 
For example, and as regards the consensus and 
ancestral envelope sequences, the invention 

10 encompasses envelope sequences devoid of V3 . 

Alternatively, V3 sequences can be selected from 
preferred sequences, for example, those described in 
U.S. Application No. 10/431,596 and U.S. Provisional 
Application No. 60/471,327. In addition, an optimal 

is immunogen for breadth of response can include 

mixtures of group M consensus gragr, pol, nef and ejnv 
encoding sequences, and as well as consist of 
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mixtures of subtype consensus or ancestral encoding 
sequences for gagr, pol , nef and env HIV genes. Foot 
dealing with regional differences in virus strains, 
an efficacious mixture can include mixtures of 
5 consensus/ancestral and wild type encoding 
sequences . 

A consensus or* ancestral envelope of the 
invention can be been "activated" to expose 
intermediate conformations of neutralization 

10 epitopes that normally are only transiently or less 
well exposed on the surface of the HIV virion. The 
immunogen can be a "frozen" triggered form of a 
consensus or ancestral envelope that makes available 
specific epitopes for presentation to B lymphocytes. 

15 The result, of this epitope presentation is the 

production of antibodies that broadly neutralize 
HIV. (Attention is directed to WO 02/024149 and to 
the activated/triggered envelopes described 
therein. ) 

20 - The concept of a fusion intermediate immunogen 

is consistent with observations that the gp41 HR-2 
region peptide, DP17 8, can capture an uncoiled 
conformation of gp41 (Furata et al , Nature Struct. 
Biol. 5:276 (1998)), and that formalin- fixed HIV- 

25 infected cells can generate broadly neutralizing 

antibodies (LaCasse et al , Science 283:357 (1997)) . 
Recently a monoclonal antibody against the coiled- 
coil region bound to a conformational determinant of 
gp41 in HR1 and HR2 regions of the coiled- coil gp41 

30 structure, but did not neutralize HIV (Jiang et al , 
J . Virol. 10213 (1998)). However, this latter study 
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proved that the coiled-coil region is available for 
antibody to bind if the correct antibody is 
generated . 

The immunogen of one aspect of the invention 
5 comprises a consensus or ancestral envelope either 
in soluble form or anchored, for example, in cell 
vesicles or in liposomes containing translipid 
bilayer envelope. To make a more native envelope, 
gpl4 0 or gp!60 consensus or ancestral sequences can 

10 be configured in lipid bilayers for native trimeric 
envelope formation. Alternatively, triggered gpl6 0 
in aldrithio 1-2 inactivated HIV-1 virions can be 
used as an immunogen. The gpl60 can also exist as a 
recombinant protein either as gpl60 or gpl40 (gpl40 

15 is gpl60 with the transmembrane region and possibly 
other gp41 regions deleted) . Bound to gpl60 or 
gpl4 0 can be recombinant CCR5 or CXCR4 co-receptor 
proteins (or their extracellular domain peptide or 
protein fragments) or antibodies or other ligands 

20 that bind to the CXCR4 or CCR5 binding site on 

gpl20, and/or soluble CD4 , or antibodies or other 
ligands that mimic the -binding actions of CD4 . 
Alternatively, vesicles or liposomes containing CD4 , 
CCR5 (or CXCR4) , or soluble CD4 and peptides 

25 reflective of CCR5 or CXCR4 gpl20 binding sites. 

Alternatively, an optimal CCR5 peptide ligand can be 
a peptide from the N- terminus of CCR5 wherein 
specific tyrosines are sulfated (Bormier et al , 
Proc. Natl. Acad. Sci . USA 97:5762 (2001)). The 

30 triggered immunogen may not need to be bound to a 

membrane but may exist and be triggered in solution. 
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Alternatively, soluble CD4 (sCD4) can be replaced by 
an envelope (gpl40 or gpl60) triggered by CD4 
peptide mimetopes (Vitra et al , Proc. Natl. Acad. 
Sci. USA 96:1301 (1999)). Other HIV co-receptor 
5 molecules that "trigger" the gpl60 or gpl40 to 

undergo changes associated with a structure of gplGO 
that induces cell fusion can also be used. Ligation 
of soluble HIV gp!40 primary isolate HIV 89.6 
envelope with soluble CD4 (sCD4) induced 
10 conformational changes in gp41. 

In one embodiment, the invention relates to an 
immunogen that has the characteristics of a receptor 
(CD4) -ligated consensus or ancestral envelope with 
CCR5 binding region exposed but unlike CD4- ligated 
.15 proteins that have the CD4 binding site blocked, 
this immunogen has the CD4 binding site exposed 
(open) . Moreover, this immunogen can be devoid of 
host CD4, which avoids the production of potentially 
harmful anti-CD4 antibodies upon administration to a 
20 host . - ... 

The immunogen can comprise consensus or 
ancestral envelope ligated with a ligand that binds 
to a site on gpl20 recognized by an A32 monoclonal 
antibodies (mab) (Wyatt et al , J. Virol. 69:5723 
25 (1995), Boots et al , AIDS Res. Hum. Retro. 13:1549 
(1997), Moore et al , J. Virol. 68:8350 (1994), 
Sullivan et al , J. Virol. 72:4694 (1998), Fouts et 
al, J. Virol. 71:2779 (1997), Ye et al , J. Virol. 
74:11955 (2000)). One A32 mab has been shown to 
30 mimic CD4 and when bound to gpl20, upregulates 

(exposes) the CCR5 binding site (Wyatt et al , J. 
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Virol. 69:5723 (1995)). Ligation of gpl20 with such 
a ligand also upregulates the CD4 binding site and 
does not block CD4 binding to gpl2 0. 

Advantageously, such ligands also upregulate the HR- 
5 2 binding site of gp41 bound to. cleaved gpl2 0, 

uncleaved gpl4 0 and cleaved gp41, thereby further 
exposing HR-2 binding sites on these proteins - each 
of which are potential targets for ant i -HIV 
neutralizing antibodies. 

io In a specific aspect of this embodiment, the 

immunogen comprises soluble HIV consensus or 
ancestral gpl20 envelope ligated with either an 
intact A3 2 mab, a Fab2 fragment of an A3 2 mab, or a 
Fab fragment of an A32 mab, with the result that the 

15 CD4 binding site, the CCR5 binding site and the HR-2 
binding site on the consensus or ancestral envelope 
are exposed/upregulated. The immunogen can comprise 
consensus or ancestral envelope with an A32 mab (or 
fragment thereof) bound or can. comprise consensus or 

20 ancestral envelope with an A32 mab (or fragment 

thereof) bound and cross-linked with a cross-linker 
such as .3% formaldehyde or a heterobif unctional 
cross-linker such as DTSSP (Pierce Chemical 
Company) . The immunogen can also comprise uncleaved 

25 consensus or ancestral gpl4 0 or a mixture of 

uncleaved gpl40, cleaved gp41 and cleaved gpl20. An 
A3 2 mab (or fragment thereof) bound to consensus or 
ancestral gpl4 0 and/or gpl2 0 or to gpl2 0 non- 
covalently bound to gp41, results in upregulation 

30 (exposure) of HR-2 binding sites in gp41, gpl2 0 and 
uncleaved gpl40. Binding of an A32 mab (or fragment 
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thereof) to gpl20 or gpl40 also results in 
upregulation of the CD4 binding site and the CCR5 
binding site. As with gpl2 0 containing complexes, 
complexes comprising uncleaved gpl4 0 and an A3 2 mab 
5 (or- fragment thereof) can be used as an immunogen 
uncross -linked or cross-linked with cross-linker 
such as .3% formaldehyde or DTSSP. In one 
embodiment, the invention relates to an immunogen 
comprising soluble uncleaved consensus or ancestral 

10 gpl4 0 bound and cross linked to a Fab fragment or 

whole A32 mab, optionally bound and cross-linked to 
an HR-2 binding protein. 

The consensus or ancestral envelope protein 
triggered with a ligand that binds to the A32 mab 

15 binding site on gp!2 0 can be administered in 
combination with at least a second immunogen 
comprising a second envelope, triggered by a ligand 
that binds to a site distinct from the A32 mab 
binding site, such as the CCR5 binding site 

20 recognized by. mab 17b . The 17b mab (Kwong et al ,_ 
Nature 393:648 (1998) available from the AIDS 
Reference Repository, NIAID, NIH) augments sCD4 
binding to gpl20. This second immunogen (which can 
also be used alone or in combination with triggered 

25 immunogens other than that described above) can, for 
example, comprise soluble HIV consensus or ancestral 
envelope ligated with either the whole 17b mab, a 
Fab2 fragment of the 17b mab, or a Fab fragment of 
the 17b mab. It will be appreciated that other CCR5 

30 ligands, including other antibodies (or fragments 

thereof) , that result in the CD4 binding site being 
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exposed can be used in lieu of the 17b mab. This 
further immunogen can comprise gpl2 0 with the 17b 
mab, or fragment thereof, (or other CCR5 ligand as 
indicated above) bound or can comprise gpl2 0 with 
the 17b mab, or fragment thereof, (or other CCR5 
ligand as indicated above) bound and cross-linked 
with an agent such as .3% formaldehyde or a 
heterobifunctional cross-linker, such as DTSSP 
(Pierce Chemical Company) . Alternatively, this 
further immunogen can comprise uncleaved gpl4 0 
present alone or in a mixture of cleaved gp41 and 
cleaved gpl20. Mab 17b, or fragment thereof (or 
other CCR5 ligand as indicated above) bound to gpl4 0 
and/or gpl20 in such a mixture results in exposure 
of the CD4 binding region. The 17b mab, or fragment 
thereof, (or other CCR5 ligand as indicated above) 
gpl4 0 complexes can be present uncross -linked or 
cross-linked with an agent such as .3% formaldehyde 
or DTSSP . 

Soluble HR-2 peptides, such as T649Q2 6L and 
DP178, can be added to the above-described complexes 
to stabilize epitopes on consensus gpl20 and gp41 as 
well as uncleaved consensus gpl4 0 molecules, and can 
be administered either cross-linked or uncross- 
5 linked with the complex. 

A series of monoclonal antibodies (mabs) have 
been made that neutralize many HIV primary isolates, 
including, in addition to the 17b mab described 
above, mab IgGlbl2 that binds to the CD4 binding 
0 site on gpl2 0 (Roben et al , J. Virol. 68:482 (1994) , 
Mo et al, J . Virol. 71:6869 (1997)), mab 2G12 that 
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binds to a conformational determinant on gpl2 0 
(Trkola et al, J. Virol. 70:1100 (1996)), and mab 
2F5 that binds to a membrane proximal region of gp41 
(Muster et al, J. Virol. 68:4031 (1994)). 
5 As indicated above, various approaches can be 

used to "freeze" fusogenic epitopes in accordance 
with the invention. For example, "freezing" can be 
effected by addition of the DP-178 or T-649Q26L 
peptides that represent portions of the coiled coil 

10 region, and that when added to CD4 -triggered 
consensus or ancestral envelope, result in 
prevention of fusion (Rimsky et al, J. Virol. 
72:986-993 (1998)). HR-2 peptide bound consensus or 
ancestral gpl20, gpl40, gp41 or gpl60 can be used as 

15 an immunogen or crosslinked by a reagent such as 
DTSSP or DSP (Pierce Co.), formaldehyde or other 
crosslinking agent that has a similar effect. 

"Freezing" can also be effected by the addition 
of 0.1% to 3% formaldehyde or paraformaldehyde, both 

20 protein cross- linking agents, to the ^complex, to 

stabilize the CD4 , CCR5 or CXCR4 , HR-2 peptide gp!60 
complex, or to stabilize the "triggered" gp41 
molecule, or both (LaCasse et al , Science 283:357- 
362 (1999) ) . 

25 Further, "freezing" of consensus or ancestral 

gp41 or gpl20 fusion intermediates can be effected 
by addition of heterobif unct ional agents such as DSP 
(dithiobis [succimidylproprionate] ) (Pierce Co. 
Rockford, ILL., No. 22585ZZ) or the water soluble 

30 DTSSP (Pierce Co.) that use two NHS esters that are 
reactive with amino groups to cross link and 
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stabilize the CD4 , CCR5 or CXCR4 , HR-2 peptide gpl60 
complex, or to stabilize the "triggered" gp41 
molecule, or both. 

Analysis of T cell immune responses in 
5 immunized or vaccinated animals and humans shows 
that the envelope protein is normally not a main 
target for T cell immune response although it is the 
only gene that induces neutralizing antibodies. 
HIV-1 Gag, Pol and Nef proteins induce a potent T 
D cell immune response. Accordingly, the invention 
includes a repertoire of consensus or ancestral 
immunogens that can induce both humoral and cellular 
immune responses . Subujiits of consensus or 
ancestral sequences can be used as T or B cell 
5 immunogens. (See Examples 6 and 7, and Figures 
referenced therein, and Figures 63-127. 

The immunogen of the invention can be 
formulated with a pharmaceutically acceptable 
carrier and/or adjuvant (such as alum) using 
o techniques well known in the art. Suitable routes 
of administration of the present immunogen include 
systemic (e.g. intramuscular or subcutaneous). 
Alternative routes can be used when an immune 
response is sought in a mucosal immune system (e.g., 
5 intranasal) . 

The immunogens of the invention can be 
chemically synthesized and purified using methods 
which are well known to the ordinarily skilled 
artisan. The immunogens can also be synthesized by 
.0 well-known recombinant DNA techniques. Nucleic 

acids encoding the immunogens of the invention can 
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be used as components of, for example, a DNA vaccine 
wherein the encoding sequence is administered as 
naked DNA or, for example, a minigene encoding the 
immunogen can be present in a viral vector. The 
5 encoding sequence can be present, for example, in a 
replicating or non-replicating adenoviral vector, an 
adeno-associated virus vector, an attenuated 
mycobacterium tuberculosis vector, a Bacillus 
Calmette Guerin (BCG) vector, a vaccinia or Modified 
10 Vaccinia Ankara (MVA) vector, another pox virus 

vector, recombinant polio and other enteric virus 
vector, Salmonella species bacterial vector, 
Shigella species bacterial vector, Venezuelean 
Equine Encephalitis Virus (VEE) vector, a Semliki 
15 Forest Virus vector, or a Tobacco Mosaic Virus 
vector. The encoding sequence, can also be 
expressed as a DNA plasmid with, for example, an 
active promoter such as a CMV promoter. Other live 
vectors can also be used to express the sequences of 

20 -the -invention Express ion ._o.f -the. .immunogen of ^the 

invention can be induced in a patient's own cells, 
by introduction into those cells of nucleic acids 
that encode the immunogen, preferably using codons 
and promoters that optimize expression in human 
25 cells. Examples of methods of making and using DKTA 
vaccines are disclosed in U.S. Pat. Nos . 5,580,859, 
5,589,466, and 5,703,055. 

The composition of the invention comprises an 
immunologically effective amount of the immunogen of 
30 this invention, or nucleic acid sequence encoding 
same, in a pharmaceutic ally acceptable delivery 
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system. The compositions can be used for prevention 
and/or treatment of immunodeficiency virus 
infection. The compositions of the invention can be 
formulated using adjuvants, emulsifiers, 

5 pharmaceutically-acceptable carriers or other 
ingredients routinely provided in vaccine 
compositions. Optimum formulations can be readily 
designed by one of ordinary skill in the art and can 
include formulations for immediate release and/or 

10 for sustained release, and for induction of systemic 
immunity and/or induction of localized mucosal 
immunity (e.g, the formulation can be designed for 
intranasal administration) . The present 
compositions can be administered by any convenient 

15 route including subcutaneous, intranasal, oral, 

intramuscular, or other parenteral or enteral route. 
The immunogens can be administered as a single dose 
or multiple doses. Optimum immunization schedules 
can be readily determined by the ordinarily skilled 

20 artisan and can vary with the patient, the 
composition and the effect sought. 

The invention contemplates the direct use of 
both the immunogen of the invention and/or nucleic 
acids encoding same and/or the immunogen expressed. 

25 as minigenes in the vectors indicated above. For 
example, a minigene encoding the immunogen can be 
used as a prime and/or boost. 

The invention includes any and all amino acid 
sequences disclosed herein and, where applicable, CF 

30 and CFI forms thereof, as well as nucleic acid 
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sequences encoding same (and nucleic acids 
complementary to such encoding sequences) . 

Certain aspects of the invention can be 
described in greater detail in the non-limiting 
5 Examples that follows . 

EXAMPLE 1 

Artificial HIV-1 Group M Consensus Envelope 
EXPERIMENTAL DETAILS 

10 Expression of CON6 gp!2 0 and gpl4 0 proteins in 

recombinant vaccinia, viruses (W) . To express and 
. purify the secreted form of HIV-1 CON6 envelope 
proteins, CON6 gpl2 0 and gpl4 0CF plasmids were 
constructed by introducing stop codons after the 

15 gpl2 0 cleavage site (REKR) and before the 

transmembrane domain (YIKIFIMIVGGLIGLRIVFAVLSIVN) , . 
respectively. The gpl2 0/gp41 cleavage site and 
fusion domain of gp41" were deleted in the gpl40CF 
protein. Both CON6 gp!2 0 and gpl4 0CF DNA constructs 

20 were cloned into the pSCG5 vector (from Bernard 
Moss, NIH, Bethesda, MD) at Sail and Kpnl 
restriction enzyme sites. This vector contains the 
lacZ gene that is controlled by the p7 . 5 promoter. 
A back-to-back P E/L promoter was used to express 

25 CON6 env genes. BSC-1 cells were seeded at 2 x 10 5 
in each well in a 6-well plate, infected with wild- 
type vaccinia virus (WR) at a MOI of 0.1 pfu/cell, 
and 2 hr after infection, pSC65 -derived plasmids 
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containing CON6 env genes were transfected into thie 
W-infected cells and recombinant (r) W selected as 
described (Moss and Earl, Current Protocols in 
Molecular Biology, eds, Ausubel et al (John Wiley & 
5 Sons, Inc. . Indianapolis, IN). pp. 16.15.1-16.19.9 

(1998)). Recombinant W that contained the CON6 env 
genes were confirmed by PGR and sequencing analysis. 
•Expression of the CON6 envelope proteins was 
confirmed by SDS-PAGE and Western blot assay. 

10 Recombinant C0N6 gpl2 0 and gpl4 0CF were purified 

with agarose galanthus Nivalis lectin beads (Vector 
Labs, Burlingame, CA) , and stored at -70°C until use. 
Recombinant W expressing JRFL (vCB-28) or 96ZM651 
(vT241R) gpl60 were obtained from the NIH AIDS 

is Research and Reference Reagent Program (Bethesda, 
MD) . 

Monoclonal Antibodies and gp!20 Wild-type 
Envelopes . Human mabs against a conformational 
20 determinant on gpl20 (A32) , the gpl20 V3 loop (F39F) 
and the CCR5 binding site (17b) were the gifts of 
James Robinson (Tulane Medical School, New Orleans, 
LA) (Wyatt et al , Nature 393;705-711 (1998), Wyatt 
et al, J. Virol. 69:5723-5733 (1995)). Mabs 2F5, 
25 447, bl2, 2G12 and soluable CD4 were obtained from 
the NIH AIDS Research and Reference Reagent Program 
(Bethesda, MD) (Gorny et al, J . Immunol. 159:5114- 
5122 (1997), Nyambi et al, J. Virol. 70:6235-6243 
(1996), Purtscher et al, AIDS Res. Hum. Retroviruses 
30 10:1651-1658 (1994), Trkola et al , J . Virol 70:1100- 
1108 (1996)) . T8 is a murine mab that maps to the 
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■ gpl20 CI region (a gift from P. Earl, NIH, Bethesda, 
MD) . BaL (subtype B) , 96ZM651 (subtype C) , and 
93TH975 (subtype E) gpl20s were provided by QBI , 
Inc. and the Division of AIDS, NIH. CHO cell lines 
5 that express 92U037 (subtype A) and 93BR029 (subtype 
F) gpl40 (secreted and uncleaved) were obtained from 
NICBS, England. 

Surface Plasmon Resonance 'Biosensor (SPR) 

10 Measurements and ELISA . SPR biosensor measurements 
were determined on a BIAcore 3 000 instrument 
(BIAcore Inc., Uppsala, Sweden) instrument and data 
analysis was performed using BIAevaluat ion 3.0 
software (BIAcore Inc, Upsaala, Sweden) . Anti-gpl2 0 

15 mabs (T8, A32, 17b, 2G12) or sCD4 in lOmM Na-acetate 
buffer, pH 4 . 5 were directly immobilized to a CM5 
sensor chip using a standard amine coupling protocol 
for protein immobilization. FPLC purified CON6 
gpl2 0 monomer or gpl4 0CF oligomer recombinant 

.20 proteins were flowed over CM5 sensor chips at 

concentrations of 100 and 3 00 /zg/ml , respectively. 
A blank in-line reference surface (activated and de- 
activated for amine coupling) or non-bonding mab 
controls were used to subtract non-specific or bulk 

25 responses. Soluble 89.6 gpl20 and irrelevant IgG 
was used as a positive and negative control 
respectively and to ensure activity of each mab 
surface prior to injecting the CONG Env proteins. 
Binding of CONG envelope proteins was monitored in 

30 real-time at 25°C with a continuous flow of PBS (150 
mM NaCl, 0.005% surfactant P20) , pH 7.4 at 10-30 
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/il/min. Bound proteins were removed and the sensor 
surfaces were regenerated following each cycle of 
binding -by single or duplicate 5-10 \il pulses of 
regeneration solution (10 mM glycine-HCl, pH 2.9). 
5 EL1SA was performed to determine the reactivity of 
various mabs to COM6 gpl2 0 and gpl4 0CF proteins as 
described (Haynes et al, AIDS Res. Hum. Retroviruses 
11:211-221 (1995)). For assay of human mab binding 
to rgpl2 0 or gpl4 0 proteins, end-point titers were 
10 defined as the highest titer of mab (beginning at 2 0 
ptg/ml) at which the mab bound CON6 gpl2 0 and gpl4 0CF 
Env proteins > 3 fold over background control (non- 
binding human mab) . 

15 Infective ty and coreceptor usage assays. HIV- 

l/SG3Aenv and CON6 or control env plasmids were 
cotransfected into human 293T cells. Pseudotyped 
viruses were harvested, filtered and p24 
concentration was quantitated (DuPont/NEN Life 

20 Sciences, Boston, MA) . Equal amounts of p24 (5 ng> 
for each pseudovirion were used to infect JC53-BL 
cells to determine the infectivity (Derdeyn e al , J. 
Virol. 74:8358-8367 (2000), Wei et al , Antimicrob 
Agents Chemother. 46:1896-1905 (2002)). JC53-BL 

25 cells express CD4 , CCR5 and CXCR4 receptors and 
contain a P-galactosidase (p-gal) gene stably 
integrated under the transcriptional control of an 
HIV-1 long terminal repeat (LTR) . These cells can 
be used to quantify the infectious titers of 

30 pseudovirion stocks by staining for p-gal expression 
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and counting the number of blue cells (infectious 
units) per microgram of p24 of pseudovirons (IU//xg 
p24) (Derdeyn e al , J. Virol. 74:8358-8367 (2000), 
Wei et al, Antimicrob Agents Chemother. 46:1896-19 05 
(2002)). To determine the coreceptor usage of the 
C0N6 env gene, JC53BL cells were treated with 1.2 fiM 
AMD3100 and 4 /iM TAK-799 for 1 hr at 37°C then 
infected with equal amounts of p24 (5 ng) of each 
Env pseudotyped virus. The blockage efficiency was 
expressed as the percentage of the infectious -units 
from blockage experiments compared to that from 
control culture without' blocking agents. The 
infectivity from control group (no blocking agent) 
was arbitrarily set as 100%. 

Immuni zations . All animals were housed in the 
Duke University Animal Facility under AALAC 
guidelines with animal use protocols approved by the 
Duke University Animal Use and Care Committee. 

o Recombinant CON6- gpl2 0 and gpl4 0CF -glycoproteins 

were formulated in a stable emulsion with RIBI-CWS 
adjuvant based on the protocol provided by the 
manufacturer (Sigma Chemical Co., St. Louis, MO). 
For induction of anti-envelope antibodies, each of 
5 four out-bred guinea pigs (Harlan Sprague, Inc., 
Chicago, IL) was given 100 \ig either purified CONG 
gpl20 or gp!4 0CF subcutaneous ly every 3 weeks (total 
of 5 immunizations) . Serum samples were heat- 
inactivated (56°C, 1 hr) , and stored at -20°C until 
o use . 
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For induction of anti-envelope T cell 
responses, 6-8 wk old female BALB/c mice (Frederick 
Cancer Research and Developmental Center, NCI, 
Frederick, MD) were immunized i.m. in the quadriceps 
5 with 50 fig plasmid DNA three times at a 3-week 
interval . Three weeks after the last DNA 
immunization, mice were boosted with 10 7 PFU of rW 
expressing Env proteins. Two weeks after the boost, 
all mice were euthanized and spleens were removed 
10 for isolation of splenocytes. 

Neutralization assays. Neutralization assays 
were performed using either a MT-2 assay as 
described in Bures et al , AIDS Res . Hum. 

15 Retroviruses 16:2019-2035 (2000), a lucif erase-based 
multiple replication cycle HIV-1 infectivity assay 
in 5 . 25 . GFP.Luc .M7 cells using a panel of HIV-1 
primary isolates (Bures et al , AIDS Res. Hum. 
Retroviruses 16:2019-2035 (2000), Bures et al , J. 

20 Virol. 76:2233-2244 (2002>), or a syncytium (fusion 
from without) inhibition assay using inactivated 
HIV-1 virions (Rossio et al , J. Virol. 72:7992-8001 
(1998) ) . In the lucif erase-based assay, 
neutralizing antibodies were measured as a function 

25 of a reduction in luciferase acitivity in 

5 .25 .EGFP.Luc.M7 cells provided by Nathaniel R. 
Landau, Salk Institute, La Jolla, CA (Brandt et al , 
J. Biol. Chem. 277:17291-17299 (2002)). Five 
hundred tissue culture infectious dose 50 (TCID 50 ) of 

30 cell -free virus was incubated with indicated serum 



62 



WO 2005/028625 



PCT/US20O4/030397 



dilutions in 150 fil (1 hr, at 37°C) in triplicate in 
96-well flat-bottom culture plates. The 
5 .25 .EGFP.Luc.M7 cells were suspended at a density 
of 5 x 10 5 /ml in media containing DEAE dextran (10 
5 jag/ml) . Cells (100 |^1) were added and until 10% of 
cells in control wells (no test serum sample) were 
positive for GFP expression by fluorescence 
microscopy. At this time the cells were 
concentrated 2- fold by removing one -half volume of 

10 media. A 50 \xl suspension of cells was transferred 
to 96-well white solid plates (Costar, Cambridge, 
MA) for measurement of luciferase activity using 
Bright-Glo™ substrate (Promega, Madison, WI) on a 
Wallac 1420 Multilabel Counter (PerkinElmer Life 

15 Sciences, Boston, MA) . Neutralization titers in the 
MT-2 and luciferase assays were those where _> 50% 
virus infection was inhibited. Only values that 
titered beyond 1:20 (i.e. >1:30) were considered 
significantly positive. The syncytium inhibition 

20 "fusion from without" assay utilized HIV-1" 

aldrithiol-2 (AT-2) inactivated virions from HIV-1 
subtype B strains ADA and AD8 (the gift of Larry 
Arthur and Jeffrey Lifson, Frederick Research Cancer 
Facility, Frederick, MD) added to SupTl cells, with 

25 syncytium inhibition titers determined as those 
titers where _>90% of syncytia were inhibited 
compared to prebleed sera. 

Enzyme linked immune spot (ELISPOT) assay. 
30 Single-cell suspensions of splenocytes from 



63 



WO 2005/028625 PCT/US2O04/030397 



individual immunized mice were prepared by mincing 
and forcing through a 70 |im Nylon cell strainer (BD 
Labware, Franklin Lakes, NJ) . Overlapping Env 
peptides of CON6 gpl40 (159 peptides, 15mers 
5 overlapping by n) were purchased from Boston 

Bioscence, Inc (Royal Oak, MI) . Overlapping Env 
peptides of MN gpl40 (subtype B; 170 peptides, 
15mers overlapping by 11) and Chnl9 gpl40 (subtype 
C; 69 peptides, 2 0mers overlapping by 10) were 

10 obtained from the NIH AIDS Research and Reference 
Reagent Program (Bethesda, MD) . Splenocytes (5 
mice/group) from each mouse were stimulated in vitro 
with overlapping Env peptides pools from CON6 , 
subtype B and subtype C Env proteins. 96-well PVDF 

15 plates (Multiscreen- IP, Millipore, Billerica, MA) 
were coated with anti-IFN-y mab (5 |J.g/ml , AN18; 
Mabtech, Stockholm, Sweden) . After the plates were 
blocked at 37" C for 2 hr using complete Hepes 
buffered RPMI medium, 50|J,1 of the pooled overlapping 

20 envelope peptides (13 CON6 and MN pools, 13-14 

peptides in each pool; 9 Chnl9 pool, 7-8 peptide in 
each pool) at a final concentration of 5 /xg/ml of 
each were added to the plate. Then 50 //l of 
splenocytes at a concentration of 1.0 X 10 7 /ml were 

25 added to the wells in duplicate and incubated for 16 
hr at 3 7 "C with 5% C0 2 . The plates were incubated 
with 100 of a 1:1000 dilution of streptavidin 
alkaline phosphatase (Mabtech, Stockholm, Sweden) , 
and purple spots developed using 100 /il of BCIP/NBT 

30 (Plus) Alkaline Phosphatase Substrate (Moss, 
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Pasadena, MD) . Spot forming cells (SFC) were 
measured using an Immunospot counting system (CTL 
Analyzers, Cleveland, OH). Total responses for each 
envelope peptide pool are expressed as SFCs per 10 6 
5 splenocytes. 

RESULTS 

CONS Envelope Gene Design,. Construction and 

10 Expression. An artificial group M consensus env 

gene (C0N6) was constructed by generating consensus 
sequences of env genes for each HIV-1 subtype from 
sequences in the Los Alamos HIV Sequence Database, 
and then generating a consensus sequence of all 

is subtype consensuses to avoid heavily sequenced 
subtypes (Gaschen et al , Science 296:2354-2360 
(2002), Korber et al , Science 288:1789-1796 (2000)). 
Five highly variable regions from a CRF0 8_BC 
recombinant strain (98CN0 06) (VI, V2 , V4 , V5 and a 

20 region in cytoplasmic domain of gp41) were then used 
to fill in the missing regions in CON6 sequence. 
The CON6 V3 region is group M consensus (Figure 1A) . 
For high levels of expression, the codons of C0N6 
env gene were optimized based on codon usage for 

25 highly expressed human genes (Haas et al, Curr. 
Biol. 6:315-324 (2000), Andre et al , J. Virol. 
72:1497-1503 (1998)). (See Fig. ID.) The codon 
optimized CON6 env gene was constructed and 
subcloned into pcDNA3 . 1 DNA at EcoR I and BamH I 

30 sites (Gao et al , AIDS Res. Hum. Retroviruses, 
19:817-823 (2003)). High levels of protein 
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expression were confirmed with Western-blot assays 
after transfection into 293T cells. To obtain 
recombinant CON 6 Env proteins for characterization 
and use as immunogens , rW was generated to express 
5 secreted gpl2 0 and uncleaved gpl4 0CF (Figure IB) . 
Purity for each protein was ,>9 0% as determined by 
Coomassie blue gels under reducing conditions 
(Figure 1C) . 

10 CD4 Binding Domain and Other Wild- type HIV-1 

Epitopes are Preserved on CON6 Proteins. To 
determine if CON6 proteins can bind to CD4 and 
express other wild-type HIV-1 epitopes, the ability 
of CONG gpl2 0 and gpl4 0CF to bind soluble (s) CD4 , to 

is bind several well-characterized anti-gpl20 mabs, and 
to undergo CD4- induced conformational changes was 
assayed. Fii~st, BIAcore CMS sensor chips were 
coated with either sCD4 or mabs to monitor their 
binding activity to CON6 Env proteins. It was found 

20 that both monomeric CON6 gpl2 0 and oligomeric 

gpl40CF efficiently bound sCD4 and anti-gpl20 mabs 
T8, 2G12 and A32, but did not constitutively bind 
mab 17b, that recognizes a CD4 inducible epitope in 
the CCR5 binding site of gp!2 0 (Figures 2A and 2B) 

2 5 Both sCD4 and A3 2 can expose the 17b binding epitope 
after binding to wild- type gpl2 0 (Wyatt et al , 
Nature 393/705-711 (1998), Wyatt et al , J. Virol. 
69:5723-5733 (1995)). To determine if the 17b 
epitope could be induced on CON6 Envs by either sCD4 

30 or A32, sCD4, A32 and T8 were coated on sensor 

chips, then CON6 gpl2 0 or gpl4 0CF captured, and mab 
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17b binding activity monitored. After binding sCD4 
or mab A32, both CON6 gpl20 and gp!40CF were 
triggered to undergo conformational changes and 
bound mab 17b (Figures 2C and 2D) . In contrast, 
5 after binding mab T8 , the 17b epitope was not 

exposed (Figures 2C and 2D) . ELISA was next used to 
determine the reactivity of a panel of human mabs 
against the gpl20 V3 loop (447, F39F) , the CD 4 
binding site (bl2) , and the gp41 neutralizing 

10 determinant (2F5) to CON 6 gpl2 0 and gpl4 0CF (Figure 
2E) . Both CON6 rgpl2 0 and rgpl4 0CF proteins bound 
well to neutralizing V3 mabs 44 7 and F3 9F and to the 
potent neutralizing CD4 binding site mab bl2 . Mab 
2F5, that neutralizes HIV-1 primary isolates by 

15 binding to a C- terminal gp41 epitope, also bound 
well to CON6 gpl40CF (Figure 2E) . 

CON6 env Gene is Biologically Functional and 
Uses CCR5 as its Coreceptor. To determine whether 

20- - CON6. envelope _ gene.. is., biologically _ functional , _it_ 

was co-transf ected with the env- defective SG3 
proviral clone into 293T cells. The pseudotyped 
viruses were harvested and JC53BL cells infected. 
Blue cells were detected in JC53-BL cells infected 
25 with the CONS Env pseudovirions , suggesting that 

CON6 Env protein is biologically functional (Figure 
3A) . However, the infectious titers were 1-2 logs 
lower than that of pseudovirions with either YU2 or 
NL4-3 wild-type HIV-1 envelopes. 
30 The co- receptor usage for the CON 6 env gene was 

next determined. When treated with CXCR4 blocking 
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agent AMD3100, the infect ivity of NL4-3 Env- 
pseudovirons was blocked while the infectivity of 
YU2 or CON6 Env-pseudovirons was not inhibited 
(Figure 3B) . In contrast, when treated with CCR5 
5 blocking agent TAK-779, the infectivity of NL4-3 
Env-pseudovirons was not affected, while the 
infectivity of YU2 or CON6 Env-pseudovirons was 
inhibited. When treated with both blocking agents, 
the infectivity of all pseudovirions was inhibited. 
10 Taken together, these data show that the CONG 

envelope uses the CCR5 co-receptor for its entry 
into target cells. 

-Reaction of CON6 gpl20 With Different Subtype? 

15 Sera. To determine if multiple subtype linear 

epitopes are preserved on CON6 gpl2 0, a recombinant 
Env protein panel (gpl2 0 and gpl4 0) was generated. 
Equal amounts of each Env protein (10 0 ng) were 
loaded on SDS-polyacrylamide gels, transferred to 

20 nitrocellulose, and reacted with subtype A through G 
patient sera as well as anti-CON6 gpl20 guinea pig 
sera (1:1,000 dilution) in Western blot assays. For 
each HIV-1 subtype, four to six patient sera were 
tested. One serum representative for each subtype 

2 5 is shown in Figure 4 . 

It was found that whereas all subtype sera 
tested showed variable reactivities among Envs in 
the panel, all group M subtype patient sera reacted 
equally well with CONS gp!2 0 Env protein, 

30 demonstrating that wild-type HIV-l Env epitopes 

recognized by patient sera were well preserved on 
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the CON6 Env protein. A test was next made as to 
whether CON6 gpl2 0 antiserum raised in guinea pigs 
could react to different subtype Env proteins. It 
was found that the CONS serum reacted to its own and 
5 other subtype Env proteins equally well, with the 
exception of subtype A Env protein (Figure 4) . 

Induction of T Cell Responses to CON6 , Subtype 
B and Subtype C Envelope Overlapping Peptides . To 

io, compare T cell immune responses induced by CON6 Env 
immunogens with those induced by subtype specific 
immunogens, two additional groups of mice were 
immunized with subtype B or subtype C DNAs and with 
corresponding rW expressing subtype B or C envelope 

15 proteins. Mice immunized with subtype B (JRFL) or 
subtype C (96ZM651) Env immunogen had primarily 
subtype -specific T cell immune responses (Figure 5) . 
IFN-y SFCs from mice immunized with JRFL (subtype B) 
immunogen were detected after stimulation with 

20 subtype B (MN) peptide pools, but not with- either 

subtype C (Chnl9) or CON6 peptide pools. IFN-y SFCs 
from mice immunized with 96ZM651 (subtype C) 
immunogen were detected after the stimulation with 
both subtype C (Chnl9) and CON6 peptide .pools, but 

25 not with subtype B (MN) peptide pools. In contrast, 
IFN-y SFCs were identified from mice immunized with 
CON6 Env immunogens when stimulated with either CON6 
peptide pools as well as by subtype B or C peptide 
pools (Figure 5) . The T cell "immune responses 

30 induced by CONG gpl4 0 appeared more robust than 
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those induced by CON6 gpl20. Taken together, these 
data demonstrated that CON6 gpl2 0 and gpl4 0CF 
immunogens were capable of inducing Tcell responses 
that recognized T cell epitopes of wild-type subtype 
5 B and C envelopes . 

Induction of Antibodies by Recombinant CONG 
gp!20 and gp!40CF Envelopes that Neutralize HIV-l 
Subtype B and C Primary Isolates. To determine if 
10 the CON6 envelope immunogens can induce antibodies 
that neutralize HIV-l primary isolates, guinea pigs 
were immunized with either C0N6 gpl2 0 or gpl4 0CF 
protein. Sera collected after 4 or 5 immunizations 
were used for neutralization assays and compared to 
15 the corresponding prebleed sera. Two AT -2 

inactivated HIV-l isolates (ADA and AD8 ) were tested 
in syncytium inhibition assays (Table 5A) . Two 
subtype B SHIV isolates, eight subtype B primary 
isolates, four subtype C, and one each subtype A, D, 
20 and E primary isolates were tested in either the MT- 
2 or the lucif erase-based assay (Table 5B) . In the 
syncytium inhibition assay, it was found that 
antibodies induced by both CON 6 gpl2 0 and gpl40CF 
proteins strongly inhibited AT- 2 inactivated ADA and 
25 AD8-induced syncytia (Table 5A) . In the MT-2 assay, 
weak neutralization of 1 of 2 SHIV isolates (SHIV 
SF162P3) by two gpl2 0 and one gpl4 0CF sera was found 
(Table 5B) . In the lucif erase-based assay, strong 
neutralization of 4 of 8 subtype B primary isolates 
30 (BX08, SF162, SS1196, and BAL) by all gpl20 and 

gpl4 0CF sera was found, and weak neutralization of 2 
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of 8 subtype B isolates (6101, 0692) by most gpl20 
and gp!40CF sera was found. No neutralization was 
detected against HIV-1 PAVO (Table 5B) . Next, the 
CON6 anti-gpl2 0 and gpl4 0CF sera were tested against 
5 four subtype C HIV-1 isolates, and weak 

neutralization of 3 of 4 isolates (DU179, DU368, and 
S080) was found, primarily by anti-CON6 gpl20 sera. 
One gpl40CF serum, no. 653, strongly neutralized 
DU179 and weakly neutralized S080 (Table 5B) . 
10 Finally, anti-CON6 Env sera strongly neutralized a 
subtype D isolate (93ZR001) , weakly neutralized a 
subtype E (CM244) isolate, and did not neutralize a 
subtype A (92RW020) isolate. 

Table 5A 



Ability of HIV-1 Group M Consensus Envelope CON6 Proteins to Induce 
Fusion inhibiting Antibodies 







Syncytium Inhibition antibody titer 1 


Guinea Pig No. 


Immunogen 


AD8 


ADA 


646 


gpi20 


270 


270 


647 


gpi20 


90 


90 


648 


gpi20 


90 


) 270 


649 


gpi20 


90 


- 90- - - 


Geometric Mean Titer 




119 


156 


650 


gpi40 


270 


270 


651 


gp140 


90 


90 


652 


gpi40 


SB10 


810 


653 


gpi40 


270 


90 


Geometric Mean Titer 




270 


207 



1 Reciprocal serum dilution at which HIV-induced syncytia of Sup T1 cells was 
inhibited by >90% compared to pre-immune serum. All prebleed sera were negative 
(titer <10). 
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CONCLUSIONS 

The production of an artificial HIV-l Group M 
consensus env genes (encoding sequences) (CON6 and 
5 . Con-S) have been described that encodes a functional 
Env protein that is capable of utilizing the CCR5 
co-receptor for mediating viral entry. Importantly, 
these Group M consensus envelope genes could induce 
T and B cell responses that recognized epitopes of 
10 subtype B and C HIV-l primary isolates. In 

addition, Con-S induces antibodies that strongly 
neutralize Subtype-C and A HIV-l strains (see 
Table 3) . 

The correlates of protection to HIV-l are not 
15 conclusively known. Considerable data from animal 
models and studies in HIV-l-inf ected patients 
suggest, the goal of HIV-l vaccine development should 
be the induction of broadly- reactive CD4+ and CD8 + 
■ ant i -HIV-l T cell responses (Letvin et al , Annu. 
20 Rev. Immunol. 20:73-99 (2002)) and high _ levels of 

antibodies that neutralize HIV-l primary isolates of 
multiple subtypes (Mascola et al, J. Virol. 73:4009- 
4018 (1999), Mascola et al , Nat. Med. 6:270-210 
(2000) ) . 

25 The high level of genetic variability of HIV-l 

has made it difficult to design immunogens capable 
of inducing immune responses of sufficient breadth 
to be clinically useful. Epitope based vaccines for 
T and B cell responses (McMichael et al , Vaccine 

30 20:1918-1921 (2002), Sbai et al , Curr. Drug Targets 
Infect, Disord. 1:303-313 (2001), Haynes, Lancet 
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348:933-937 (i 9 96)), constrained envelopes 
reflective of fusion intermediates (Fouts et al , 
Proc. Natl. Acad. Sci . USA 99:11842-22847 (2002)), 
as well as exposure of conserved high-order 
5 structures for induction of ant i -HIV- 1 neutralizing 
antibodies have been proposed to overcome HIV-1 
variability (Roben et al , J. Virol. 68:4821-4828 
(1994), Saphire et al , Science 293:1155-1159 
(2001)). However, with the ever- increasing 
10 diversity and rapid evolution of HIV-1, the virus is 
a rapidly moving complex target, and the extent of 
complexity of HIV-1 variation makes all of these 
approaches problematic. The current most common 
approach to HIV-1 immunogen design is to choose a 
15 wild- type field HIV-1 isolate that may or may not be 
from the region in which the vaccine is to be 
tested. Polyvalent envelope immunogens have been 
designed incorporating multiple envelope immunogens 
(Bartlett et al , AIDS 12:1291-1300 (1998), Cho et 
o al, J. Virol. 75:2224-2234 (2001)). 

The above-described study tests a new strategy 
for HIV-1 immunogen design by generating a group M 
consensus env gene (CON6) with decreased genetic 
distance between this candidate immunogen and wild- 
5 type field virus strains. The CONG env gene was 
generated for all subtypes by choosing the most 
common amino acids at most positions (Gaschen et al , 
Science 296:2354-2360 (2002), Korber et al , Science 
288:1789-1796 (2000)). Since only the most common 
amino acids were used, the majority of antibody and 
T cell epitopes were well preserved. Importantly, 
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the genetic distances between the group M consensus 
env- sequence and any subtype env sequences was about 
15%, which is only half of that between wild- type 
subtypes (30%) (Gaschen et al , Science 296:2354-23 60 
5 (2002)). This distance is approximately the same as 
that among viruses within the same subtype. 
Further, the group M consensus env gene was also 
about 15% divergent from any recombinant viral env 
gene, as well, since CRFs do not increase the 

10 overall genetic divergence among subtypes. 

Infect ivity of CON6-Env pseudovirions was 
confirmed using a single-round infection system, 
although the infectivity was compromised, indicating 
the artificial envelope was not in an "optimal" 

15 functional conformation, but yet was able to mediate 
virus entry. That the CON6 envelope used CCR5 (R5) 
as its coreceptor is important, since majority of 
HIV-1 infected patients are initially infected with 
R5 viruses. 

20 BIAcore analysis showed that both CON6 gpl2 0 

and gpl4 0CF bound sCD4 and a number of mabs that 
bind to wild- type HIV-1 Env proteins. The 
expression of the CONG gpl2 0 and 14 0CF proteins that 
are similar antigenically to wild- type HIV-1 

25 envelopes is an important step in HIV-1 immunogen 
development. However, many wild-type envelope 
proteins express the epitopes to which potent 
neutralizing human mabs bind, yet when used as 
immunogens themselves, do not induce broadly 

30 neutralizing anti -HIV-1 antibodies of the 

specificity of the neutralizing human mabs. 
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The neutralizing antibody studies were 
encouraging in that both CON 6 gpl2 0, CON 6 gpl4 0CF 
and Con-S gpl4 0CFI induced antibodies that 
neutralized select subtype B, C and D HIV-1 primary 
isolates, with Con-S gpl40CFI inducing the most 
robust neutralization of non- subtype B primary HIV 
isolates. However, it is clear that the most 
dif f icult-to-neutralize primary isolates (PAVO, 
6101, BG1168, 92RW020, CM244) were either only 
weakly or not neutralized by ant i- CON 6 gpl2 0 or 
gpl4 0 .sera (Table 4b) . Nonetheless, the Con-S 
envelope immunogenicity for induction of 
neutralizing antibodies is promising, given the 
breadth of responses generated with the Con-S 
subunit gpl40CFI envelope protein for non-subtype B 
HIV isolates. Previous studies with poxvirus 
constructs expressing gpl2 0 and gplGO have not 
generated high levels of neutralizing antibodies 
(Evans et al , J. Infect. Dis . 180:290-298 (1999), 
Polacino et al, J. Virol. 73:618-630 (1999), 
Ourmanov et al , J. Virol. 74:2960-2965 (2000), Pal 
et al, J. Virol 76:292-302 (2002), Excler and 
Plotkin, AIDS 11 (Suppl A):S127-137 (1997). rW 
expressing secreted CON6 gpl2 0 and gpl4 0 have been 
constructed and antibodies that neutralize HIV-1 
primary isolates induced. An HIV neutralizing 
antibody immunogen can be a combination of Con-S 
gpl4 0CFI, or subunit thereof, with immunogens that 
neutralize most subtype B isolates. 
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The structure of an oligomeric gpl4 0 protein is 
critical when evaluating protein immunogenicity . In 
this regard, study of purified CON6 gpl40CF proteins 
by fast performance liquid chromatography (FPLC) and 
analytical. ultracentrif iguration has demonstrated 
that the purified gpl40 peak consists predominantly 
of trimers with a small component of dimers . 

Thus, centralized envelopes such as CON6 , Con-S 
or 2 0 03 group M or subtype consensus or ancestral 
encoding sequences described herein, are attractive 
candidates for preparation of various potentially 
"enhanced" envelope immunogens including CD4-Env 
complexes, constrained envelope structures , and 
trimeric oligomeric forms.. The ability of CON6- 
induced T and B cell responses to protect against 
HIV-1 infection and/or disease in SHIV challenge 
models will be studied in non-human primates. 
The above study has demonstrated that 
o artificial centralized HIV-1 genes such as group M 
consensus env gene (CONG) and Con-S can also induce 
T cell responses to T cell epitopes in wild-type 
subtype B and C Env proteins as well as to those on 
group M consensus Env proteins (Figure 5} . While 
5 the DNA prime and rW boost regimen with CON6 

gpl4 0CF immunogen clearly induced IFN-y producing T 
cells that recognized subtype B and C epitopes, 
further studies are needed to determine if 
centralized sequences such as are f ound in the CON 6 
o envelope are significantly better at inducing cross- 
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clade T cell responses than wild-type HIV-1 genes 
(Ferrari et al , Proc. Natl. Acad. Sci . USA 94:1396- 
1401 (1997), Ferrari et al, AIDS Res. Hum. 
Retroviruses 16:1433-1443 (2000)). However, the 
5 fact that CON6 (and Con-S, env encoding sequence) 

prime and boosted splenocyte T cells recognized HIV- 
1 subtype B and C T cell epitopes is an important 
step in demonstration that CON6 (and Con-S) can 
induce T cell responses that might be clinically 
10 useful . 

Three computer models (consensus, ancestor and 
center of the tree (COT) ) have been proposed to 
generate centralized HIV-1 genes (Gaschen et al , 
Science 296:23 54-2360 (2002), Gao et al , Science 
15 299:1517-1518 (2003), Nickle et al, Science 
299:1515-1517 (2003), Korber et al , Science 
288:1789-1796 (2000) . They all tend to locate at 
the roots of the star-like phylogenetic trees for 
most HIV-1 sequences within or between subtypes. As 
20 experimental vaccines, they all can reduce the 

genetic distances between immunogens and field virus 
strains. However, consensus, ancestral and COT 
sequences each have advantages and disadvantages 
(Gaschen et al , Science 296:2354-2360 (2002), Gao et 
25 al, Science 299:1517-1518 (2003), Nickle et al , 
Science 2 99:1515-1517 (2003). Consensus and COT 
represent the sequences or epitopes in sampled 
current wild-type viruses and are less affected by 
outliers HIV-l sequences, while ancestor represents 
ancestral sequences that can be significantly 
affected by outlier sequences. However, at present, 
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it is not known which centralized sequence can serve 
as the best immunogen to elicit broad immune 
responses against diverse HIV-1 strains, and studies 
are in progress to test these different strategies . 
5 Taken together, the data have shown that the 

HIV-1 artificial CON6 and Con-S envelope can induce 
T cell responses to wild- type HIV-1 epitopes, and 
can induce antibodies that neutralize HIV-1 primary 
isolates, thus demonstrating the feasibility and 
10 promise of using artificial centralised HIV-1 
sequences in HIV-1 vaccine design. 

EXAMPLE 2 

HIV-1 Subtype C Ancestral and Consensus Envelope 
15 Glycoproteins 

EXPERIMENTAL DETAILS ' 

HIV-1 subtype C ancestral and consensus env 
genes were obtained from the Los Alamos HIV 
Molecular Immunology Database (http://hiv- 

20 web.lanl.gov/immunology), codon-usage optimized for 
mammalian cell expression, and synthesized (Fig. 6). 
To ensure optimal expression, a Kozak sequence 
(GCCGCCGCC) was inserted immediately upstream of the 
initiation codon. In addition to the full-length 

25 genes, two truncated env' genes were generated by 

introducing stop codons immediately after the gp41 
membrane -spanning domain (IVNR) and the gpl20/gp41 
" cleavage site (REKR) , generating gpl4 0 and gpl2 0 
form of the glycoproteins, respectively (Fig. 8) . 



79 



WO 2005/028625 PCT/US2O04/030397 



10 



15 



20 



25 



30 



Genes were tested for integrity in an in vitro 
transcription/translation system and expressed in 
mammalian cells. To determine if the ancestral and 
consensus subtype C envelopes were capable of 
mediating fusion and entry, gpieo and gp!40 genes 
were co-transf ected with an HIV-l/SG3Aenv provirus 
and the resulting pseudovirions tested for 
infect ivity using the JC53-BL cell assay (Fig. 7). 
Co-receptor usage and envelope neutralization 
sensitivity were also determined with slight 
modifications of the JC53-BL assay. Codon-usage 
optimized and rev -dependent 96ZAM651 env genes were 
used as contemporary subtype C controls. 

RESULTS 

Codon-optimized subtype C ancestral and 
consensus envelope genes {gpl60, gp!40, gpl20) 
express high levels of env glycoprotein in mammalian 
cells (Fig. 9) . - 

Codon-optimized subtype C gpl60 and gpl40 
glycoproteins are efficiently incorporated into 
virus particles. Western Blot analysis of sucrose - 
purified pseudovirions reveals ten-fold higher 
levels of virion incorporation of the codon- 
optimized envelopes compared to that of a rev- 
dependent contemporary envelope controls (Fig. 10A) . 

Virions pseudotyped with either the subtype C 
consensus gpl60 or gpl40 envelope were more 
infectious than pseudovirions containing the 
corresponding gpiso and gpl40 ancestral envelopes. 
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Additionally, gp!60 envelopes were consistently more 
infectious than their respective gpl40 counterparts 
(Fig. . 10B) . 

Both subtype C ancestral and consensus 
- envelopes utilize CCR5 as a co-receptor to mediate 
virus entry (Fig. 11). 

The infectivity of subtype C ancestral and 
consensus gplGO containing pseudovirions was 
neutralised by plasma from subtype C infected 
> patients. This suggests that these artificial 

envelopes possess a structure that is similar to 
that of native HIV-1 env glycoproteins and that 
common neutralization epitopes are conserved. No 
significant differences in neutralization potential 
5 were noted between subtype C ancestral and consensus 
env glycoproteins (gpl6 0) (Fig. 12) . 

CONCLUSIONS 

HIV-1 subtype C viruses are among the most 
- -prevalent- circulating isolates, representing 

o approximately fifty percent of new infections 
worldwide. Genetic diversity among globally 
circulating HIV-1 strains poses a challenge for 
vaccine design. Although HIV-1 Env protein is highly 
variable, it can induce both humoral and cellular 

5 immune responses in the infected host. By analyzing 
7 0 HIV-1 complete subtype C env sequences, consensus 
and ancestral subtype C env genes have been 
generated. Both sequences are roughly equidistant 
from contemporary subtype C strains and thus 
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expected to induce better cross -protective immunity. 
A reconstructed ancestral or consensus sequence 
derived- immunogen minimizes the extent of genetic 
differences between the vaccine candidate and 
5 contemporary isolates. However, consensus and 

ancestral subtype C env genes differ by 5% amino 
acid sequences. Both consensus and ancestral 
sequences have been synthesized for analyses. 
Codon-optimized subtype C ancestral and consensus 

10 envelope genes have been constructed and the in 
vitro biological properties of the expressed 
glycoproteins determined. Synthetic subtype C 
consensus and ancestral ertv genes express 
glycoproteins that are similar in their structure, 

15 function and antigenicity to contemporary subtype C 
wild-type envelope glycoproteins. 



EXAMPLE 3 

Codon-Usage Optimization of Consensus of Subtype C 
20 gag and nef Genes (C. con. gag and C.con.nef) 

Subtype C viruses have become the most 
prevalent viruses among all subtypes of Group M 
viruses in the world. More than 50% of HIV-1 
25 infected people are currently carrying HIV-1 subtype 
C viruses. In addition, there is considerable 
intra-subtype C variability: different subtype C 
viruses can differ by as much as 10%, 6%, 17% and 
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16% of their Gag, Pol, Env and Nef proteins, 
respectively. Most importantly, the subtype C 
viruses from one country can vary as much as the 
viruses isolated from- other parts of the world. The. 

5 only exceptions are HIV-l strains from India/China, 
Brazil and Ethiopia/Djibouti where subtype C appears 
to have been introduced more recently. Due to the 
high genetic variability of subtype C viruses even 
within a single country, an immunogen based on a 

10 single virus isolate may not elicit protective 

immunity against other isolates circulating 'in the 
same area. 

Thus gag and nef gene sequences of subtype C 
viruses were gathered to generate consensus 
15 sequences for both genes by using a 50% consensus 

threshold. To avoid a potential bias toward founder 
viruses, only one sequence was used from 
India/China, Brazil and Ethiopia/Djibouti, 
respectively, to generate the subtype C consensus 
20 -sequences (C. con. gag and C . con. nef ) . The codons of 
both C. con. gag and C. con. nef genes were optimized 
based on the codon usage of highly expressed human 
genes. The protein expression following transfection 
into 293T cells is shown in Figure 13. As can be 
25 seen, both consensus subtype C Gag and Nef proteins 
were expressed efficiently and recognized by Gag- 
and Nef -specific antibodies. The protein expression 
levels of both C. con. gag and C. con. nef genes are 
. comparible to that of native subtype env gene 
30 (96ZM651) . 
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EXAMPLE 4 

Synthesis of a Full Length "Consensus of the 
Consensus env Gene with Consensus Variable Regions" 
5 (CON-S) ' 

In the synthesized w consensus of the consensus" 
env gene (CON6) , the variable regions were replaced 
with the corresponding regions from a contemporary 

10 subtype C virus (98CN006) . A further con/con gene 
has been designed that also has consensus variable 
regions (CON-s) . The codons. of the Con-S env gene 
were optimized based on the codon usage of highly 
expressed human genes. (See Figs. 14A and 14B for 

15 amino acid sequences and nucleic acid sequences, 
respectfully. ) 

Paired oligonucleotides (80-mers) which overlap 
by 20 bp at their 3' ends and contain invariant 
sequences at their 5' and 3' ends, including the 

20 restriction enzyme sites EcoRI and Bbsl as well as 
BsmBI and BamHI , respectively, were designed. Bbsl 
and BamHI are Type II restriction enzymes that 
cleave outside of their recognition sequences. Thiey 
have been positioned in the oligomers in such a way 

25 that they cleave the first four resides adjacent to 
the 18 bp invariant region, leaving 4 base 5' 
overhangs at the end of each fragment for the 
following ligation step. 26 paired oligomers were 
linked individually using PCR and primers 

30 complimentary to the 18 bp invariant sequences. 
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Each pair was cloned into pGEM-T (Promega) using the 
T/A cloning method and sequenced to confirm the 
absence of inadvertent mutations/deletions. pGEM-T 
subclones containing the proper inserts were then 
digested, run on a 1% agarose gel, and gel purified 
(Qiagen) . Four individual 10 8-mers were ligated 
into pcDNA3 . 1 (Invitrogen) in a multi- fragment 
ligation reaction. The four-way ligations occurred 
among groups of fragments in a stepwise manner from 
the 5' to the 3' end of the gene. This process was 
repeated until the entire gene was reconstructed in 
the pcDNA3 . 1 vector. 

A complete Con-S gene was constructed by 
ligating the codon usage optimized oligo pairs 
together. To -confirm its open reading frame, an in' 
vitro transcription and translation assay was 

performed. Protein products were labeled by S 35 - 
methionine during the translation step, separated on 
a 10% SDS-PAGE, and detected by radioautography . 
o Expected size of the expressed Con-S gplGO was 
identified in 4 out of 7. clones (Fig. 14C) . 

CONs Env protein expression in the mammalian 
cells after transfected into 293T cells using a 
Western blot assay (Figure 15) . The expression level 
5 of Con-S Env protein is very similar to what was 
observed from the previous CON6 env clone that 
contains the consensus conservative regions and 
variable loops from 98CN006 virus isolate. 
The Env-pseudovirons was produced by 
o cotransf ecting Con-S env clone and env- deficient SG3 
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proviral clone into 293T cells. Two days after 
transf ection, the pseudovirions were harvested and 
infected into JC53BL-13 cells. The infectious units 
(IU) were determined by counting the blue cells 
5 -after staining with X-gal in three independent 

experiments. When compared with CON6 env clone, Con- 
S env clones produce similar number of IU in JC53BL- 
13 cells (Figure 16) . The IU titers for both are 
about 3 log higher than the SG3 backbone clone 

10 control (No Env) . However, the titers are also 
about 2 log lower than the positive control (the 
native HIV-1 env gene, NL4-3 or YU2) . These data 
suggest that both consensus group M env clones are 
biologically functional. Their functionality, 

15 however, has been compromised. The functional 

consensus env genes indicate that these Env proteins 
fold correctly, preserve the basic conformation of 
the native Env proteins, and are able to be 
developed as universal Env immunogens . 

20 !t w as next determined what coreceptor Con-S 

Env uses for its entry into JC53-BL cells. When 
treated with CXCR4 blocking agent AMD3100, the 
infectivity of NL4-3 Env-pseudovirons was blocked 
while the infectivity of YU2 , Con-S or CON6 Env- 

25 pseudovirons was not inhibited. In contrast, when 
treated with CCR5 blocking agent TAK779, the 
infectivity of NL4-3 Env-pseudovirons was not 
affected, while the infectivity of YU2 , Con-S or 
CONG Env-pseudovirons was inhibited. When treated 

30 with both blocking agents, the infectivity of all 
pseudovirions was inhibited. Taken together, these 
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data show that the Con-S as well as CON6 envelope 
uses the CCR5 but not CXCR4 co-receptor for its 
entry into target cells. 

It was next determined whether CONS or Con-S 
Env proteins could be equally efficiently 
incorporated in to the pseudovirions . To be able 
precisely compare how much Env proteins were 
incorporated into the pseudovirions, each 
pseudovirions is loaded on SDS-PAGE at the same 
concentraion: 5|ig total protein for cell lysate, 
25ng p24 for cell culture supernatant, or 150ng p24 
for purified virus stock (concentrated pseudovirions 
after super-speed centrif ugation) . There was no 
difference in amounts of Env proteins incorporated 
in CON 6 or Con-S Env-pseudovirions in any 
preparations (cell lysate, cell culture supernatant 
or purified virus stock) (Figure 17) . 

EXAMPLE 5 

Synthesis of a Consensus Subtype A Full Length env 

(A. con. env) Gene 

Subtype A viruses are the second most prevalent 
HIV-1 in the African continent where over 70% of 
HIV-1 infections have been documented. Consensus 
ga.g r env and nef genes for subtype C viruses that 
are the most prevalent viruses in Africa and in the 
world were previously generated. Since genetic 
distances between subtype A and C viruses are as 
high as 3 0% in the env gene, the cross reactivity or 
o protection between both subtypes will not be 
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20 



25 



30 



optimal. Two group M consensus env genes for all 
subtypes were also generated. However, to target 
any particular subtype viruses, the subtype specific 
consensus genes will be more effective since the 
genetic distances between subtype consensus genes 
and field viruses from the same subtype will be 
smaller than that between group M consensus genes 
and these same viruses. Therefore, consensus genes 
need to be generated for development of subtype A 
specific immunogens. The codons of the A. con. env 
gene were optimized based on the codon usage of 
highly expressed human genes. (See Figs. 18A and 
18B for amino acid and nucleic acid sequences, 
respectively.) 

Each pair of the oligos has been amplified, 
cloned, ligated and sequenced. After the open 
reading frame of the A. con env gene was confirmed by 
an In vitro transcription and translation system, 
the A. con env gene was transfected into the 293T 
cells and the protein expression and specificity 
confirmed with the Western blot assay (Figure 18) . 
It was then determined whether A. con envelope is 
biologically functional. It was co- transfected with 
the env-def ective SG3 proviral clone into 293T 
cells. The pseudotyped viruses were harvested and 
used to infect JC53BL cells. Blue cells were 
detected in JC53-BL cells infected with the A. con 
Env-pseudovirions, suggesting that A. con Env protein 
is biologically functional (Table 6) . However, the 
infectious titer of A. con Env-psuedovirions was 
about 7-fold lower than that of pseudovirions with 
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wild- type subtype C envelope (Table 6) . Taken 
together, the biological function A .'con Env proteins 
suggests that it folds correctly and may induce 
linear and conformational T and B cell epitopes if 
5 used as an Env immunogen. 



JC53BL13 (lU/ul) 





3/31/03 


4/7/03 


4/25/03 




non filtered supt. 


0.22|jm filtered 


0.22|jm filtered 


A.con +SG3 


4 


8.5 


15.3 


96ZM651 +SG3 


87 


133 


104 


SG3 backbone 


0 


0.07 


0.03 


Neg control 


0 


0.007 


0 



Table 6. Infectivity of pseudovirons with A.con env genes 



EXAMPLE 6 

io Design of Full Length "Consensus of the Consensus 

gag, pol and nef Genes 11 (M.con.gag, M. con. pol and 
M.con.nef ) and a Subtype C Consensus pol Gene 

(C . con . pol) 

15 For the group M consensus genes, two different 

env genes were constructed, one with virus specific 
variable regions (C0N6) and one with consensus 
variable regions (Con-S) . However, analysis of T 
cell immune responses in immunized or vaccinated 

20 animals and humans shows that the env gene normally 
is not a main target for T cell immune response 
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although it is the only gene that will induce 
neutralizing antibody. Instead, HIV-1 Gag, Pol and 
Nef proteins are found to be important for inducing 
potent T cell immune responses. To generate a 
repertoire of immunogens that can induce both 
broader humoral and cellular immune responses for 
all subtypes, it may be necessary to construct other 
group M consensus genes other than env gene alone. 
"Consensus of the consensus" gag, pol and nef genes 
(M. con. gag., M.con.pol and M. con. nef) have been 
designed. To generate a subtype consensus pol gene, 
the subtype C consensus pol gene (C. con. pol) was 
also designed. The codons . of the M. con. gag., 
M.con.pol, M. con. nef and C. con. pol. genes were 
optimized based on the codon usage of highly 
expressed human genes. (See Fig. 19 for nucleic 
acid and amino acid sequences . ) 



EXAMPLE 7 



Synthetic Subtype B Consensus gag and env Genes 

20 EXPERIMENTAL DETAILS " 

Subtype B consensus gag and env sequences were 
derived from 3 7 and 13 7 contemporary HIV-1 strains, 
respectively, codon-usage optimized for mammalian 
cell expression, and synthesized (Figs. 20A and 

25 2 0B) . To ensure optimal expression, a Kozak 
sequence (GCCGCCGCC) was inserted immediately 
upstream of the initiation codon. in addition to 
the full-length env gene, a truncated env gene was 
generated by introducing a stop codon immediately 
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after the gp41 membrane -spanning domain (IVNR) to 
create a gpl45 gene. Genes were tested for 
integrity in an in vitro transcription/translation 
system and expressed in mammalian cells. (Subtype B 
5 consensus Gag and Env sequences are set forth in 
Figs. 2 0C and 2 0D, respectively.) 

To determine if the subtype B consensus 
envelopes were capable of mediating fusion and 
entry, gpl60 and gpl45 genes were co- transf ected 

10 with an HIV- l/SG3Aenv provirus and the resulting 

pseudovirions were tested for infectivity using the 
JC53-BL cell assay. JC53-BL cells are a derivative 
of HeLa cells that express high levels of CD 4 and 
the HIV-1 coreceptors CCR5 and CXCR4. They also 

is contain the reporter cassettes of lucif erase and fj- 
galactosidase that are each expressed from an HIV-1 
LTR. Expression of the reporter genes is dependent 
on production of HIV-1 Tat. Briefly, cells are 
seeded into 24 -well plates, incubated at 3 7°.C for 24 

20 hours and treated with DEAE-Dextran at 3 7 °C f or - 

3 0min. Virus is serially diluted in 1%. DMEM, added 
to the cells incubating in DEAE-dextran, and allowed 
to incubate for 3 hours at 37°C after which an 
additional 500/zL of cell media is added to each 

25 well. Following a final 48-hour incubation at 37° C, 
cells are fixed, stained using X-Gal, and overlaid 
with PBS for microscopic counting of blue foci. 
Counts for mock- infected wells, used to determine 
background, are subtracted from counts for the 

30 sample wells. Co-receptor usage and envelope 
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neutralization sensitivity were also determined with 
slight modifications of the JC53-BL assay. 

To determine whether the subtype B consensus 
Gag protein was capable of producing virus-like 
particles (VLPs) that incorporated Env 
glycoproteins, 293T cells were co-transf ected with 
subtype B consensus gag and env genes. 48-hours 
post-transfection, cell supernatants containing VLPs 
were collected, clarified in a tabletop centrifuge, 
filtered through a 0 . 2mM filter, and pellet through 
a 2 0% sucrose cushion. The VLP pellet was 
resuspended in PBS and transferred onto a 20-60% 
continuous sucrose gradient.; Following overnight 
centrifugation at 100,000 x g, 0.5 ml . fractions were 
15 collected and assayed for p24 content. The 

refractive index of each fraction was also measured. 
Fractions with the correct density for VLPs and 
containing the highest levels of p 2 4 were pooled and 
pellet a final time. VLP -containing pellets were 
20 re-suspended in PBS and loaded on a 4-20% SDS-PAGE 
gel. Proteins were transferred to a PVDF membrane 
and probed with serum from a subtype B HIV-1 
infected individual . 



. RESULTS 



25 



Codon-usage optimized, subtype B consensus 
envelope {gpl60, gpl45) and gag genes express high 
levels of glycoprotein in mammalian cells (Fig. 21). 

Subtype B gpl60 and gp!45 glycoproteins are 
30 efficiently incorporated into virus particles. 
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Western Blot analysis of sucrose-purified 
pseudovirions suggests at least five- fold higher 
levels of consensus B envelope incorporation 
compared to incorporation of a rev-dependent 
contemporary envelope (Fig.23A). Virions 
pseudotyped with either the subtype B consensus 
gplGO or gpl45 envelope are more infectious than 
pseudovirions containing a rev-dependent 
contemporary envelope (Fig. 23 B) . 

Subtype B consensus envelopes utilize CCR5 as 
the co-receptor to gain entry into CD4 bearing 
target cells (Fig. 22) . 

The infectivity of pseudovirions containing the 
subtype B consensus gpl6 0 envelope was neutralized 
by plasma from HIV-1 subtype B infected patients 
(Fig. 24C) and neutralizing monoclonal antibodies 
(Fig. 24A) . This suggests that the subtype B 
synthetic consensus B envelopes is similar to native 
HIV-1 Env glycoproteins in its overall structure and 
that common neutralization epitopes remain intact. 
Figs. 24B and 24D show neutralization profiles of a 
subtype B control envelope (NL4 . 3 Env) . 

Subtype B consensus Gag proteins are able to 
bud from the cell membrane and form virus-like 
particles (Fig. 25A) . Co-transf ection of the codon- 
optimized subtype B consensus gag and gpl60 genes 
produces VLPs with incorporated envelope (Fig. 2 5B) . 
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CONCLUSIONS 

The synthetic subtype B consensus env and gag 
genes express viral proteins that are similar in 
their structure, function and antigenicity to 
contemporary subtype B Env and Gag proteins. It is 
contemplated that immunogens based on subtype B 
consensus genes will elicit CTL and neutralizing 
immune responses that are protective against a broad 
set of HIV-1 isolates. 



All documents and other information sources 
cited above are hereby incorporated in their 
entirety by reference. Also incorporated by 
reference is Liao et al, J. Virol. 78:5270 (2004)) 
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WHAT IS CLAIMED IS : 

1 . An isolated protein comprising the 
sequence of amino acids set forth in Fig. 1A. 

2. A nucleic acid comprising a nucleotide 
sequence encoding CON6 HIV gpl60 protein, wherein 
said nucleotide sequence comprises codons optimized 
for expression in human cells. 

3. The nucleic acid according to claim 2 
wherein said nucleic acid comprises the nucleotide 
sequence set forth in Fig. ID. 

4. A nucleic acid comprising a nucleotide 
sequence encoding subtype C ancestral HIV envelope 
protein, wherein said nucleotide sequence comprises 
codons optimized for expression in human cells. 

5. The nucleic acid according to claim 4 
wherein said nucleic acid comprises the nucleotide 
sequence set forth in Fig. 6A. 

6. A nucleic acid comprising a nucleotide 
sequence encoding subtype C consensus HIV envelope 
protein, wherein said nucleotide sequence comprises 
codons optimized for expression in human cells. 

7. The nucleic acid according to claim 6 
wherein said nucleic acid comprises the nucleotide 
sequence set forth in Fig. SB . 

8. An isolated protein comprising the 
sequence of amino acids set forth in Fig. 6C or 6D . 
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9. A nucleic acid comprising a nucleotide 
sequence encoding a subtype C consensus HIV gag 
protein, wherein said nucleotide sequence comprises 
codons optimised for expression in human cells. 

10. The nucleic acid according to claim 9 
wherein said nucleic acid comprises the nucleotide 
sequence set forth in Fig. 13E. 

11. A nucleic acid comprising a nucleotide 
sequence encoding a subtype C consensus HIV nef 
protein, wherein said nucleotide sequence comprises 
codons optimized for expression in human cells. 

12. The nucleic acid according to claim 11 
wherein said nucleic acid comprises the nucleotide 
sequence set forth in Fig. 13F. 

13. A nucleic acid comprising a nucleotide 
sequence encoding Group M consensus HIV envelope 
protein, wherein said nucleotide sequence comprises 
codons optimized for expression in human cells. 

14. The nucleic acid according to claim 13 
wherein said nucleic acid comprises the nucleotide 
sequence set forth in Fig. 14B. 

15. A nucleic acid comprising a nucleotide 
sequence encoding subtype A consensus HIV envelope 
protein, wherein said nucleotide sequence comprises 
codons optimized for expression in human cells. 
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16. The nucleic acid according to claim 15 
wherein said nucleic acid comprises the nucleotide 
sequence set forth in Fig. 18B. 

17. A, nucleic acid comprising a nucleotide 
sequence encoding Group M consensus HIV gag protein, 
wherein said nucleotide sequence comprises codons 
optimized for expression in human cells. 

18 . The nucleic acid according to claim 17 
wherein said nucleic acid comprises the nucleotide 
sequence set forth in Fig. 19A . 

19. A nucleic acid comprising a nucleotide 
sequence encoding Group M consensus HIV pol protein, 
wherein said nucleotide sequence comprises codons 
optimized for expression in human cells. 

20. The nucleic acid according to claim 19 
wherein said nucleic acid comprises the nucleotide 
sequence set forth in Fig. 19B. 

21. A nucleic acid comprising a nucleotide 
sequence encoding Group M consensus HIV nef protein, 
wherein said nucleotide sequence comprises codons 
optimized for expression in human cells. 

22 . The nucleic acid according to claim 21 
wherein said nucleic acid comprises the nucleotide 
sequence set forth in Fig. 19C. 

23. A nucleic acid comprising a nucleotide 
sequence encoding subtype C consensus HIV pol 
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protein, wherein said nucleotide sequence comprises 
codons optimized for expression in human cells. 

24 . The nucleic acid according to claim 23 
wherein said nucleic acid comprises the nucleotide 
sequence set forth in Fig. 19D. 

25. A nucleic acid comprising a nucleotide 
sequence encoding subtype B consensus HIV gag 
protein, wherein said nucleotide sequence comprises 
codons optimized for expression in human cells. 

26. The nucleic acid according to claim 25 
wherein said nucleic acid comprises the nucleotide 
sequence set forth in Fig. 2 OA. 

27. A nucleic acid comprising a nucleotide 
sequence encoding subtype B consensus HIV envelope 
protein, wherein said nucleotide sequence comprises 
codons optimized for expression in human cells. 

28. The nucleic acid according to claim 27 
wherein said nucleic acid comprises the nucleotide 
sequence set forth in Fig. 20B. 

29. An isolated protein comprising the 
sequence of amino acids set forth in Fig. 20C or 
2 0D. 

30. An isolated protein comprising the 
sequence of amino acids set forth in Fig. 2GA 
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31. A nucleic acid comprising a nucleotide 
sequence that encodes the protein according to claim 
30. 

32. The nucleic acid according to claim 31 
wherein said nucleic acid comprises the nucleotide 
sequence set forth in Fig. 2 6B. 

33. An isolated protein comprising the 
sequence of amino acids set forth in Fig. 2 8B. 

34. A nucleic acid comprising a nucleotide 
sequence encoding the protein according to claim 33. 

35. The nucleic acid sequence according to 
claim 34 wherein said nucleic acid comprises the 
nucleotide sequence set forth in Fig. 28C. 

36. An isolated protein comprising the 
sequence of amino acids set forth in Fig. 2 9B. 

37. A nucleic acid comprising a nucleotide 
sequence encoding the protein according to claim 36. 

38. The nucleic acid sequence according to 
claim 37 wherein said nucleic acid comprises the 
nucleotide sequence set forth in Fig. 2 9C. 

39. An isolated protein comprising the 
sequence of amino acids set forth in Fig. 3 OB. 

40. A nucleic acid comprising a nucleotide 
sequence encoding the protein according to claim 39. 
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41. The nucleic acid sequence according to 
claim 40 wherein said nucleic acid comprises the 
nucleotide sequence set forth in Fig. 30C. 

42. An isolated protein comprising the 
sequence of amino acids set forth in Fig. 3 IB. 

43 . A nucleic acid comprising a nucleotide 
sequence encoding the protein according to claim 42. 

44 . The nucleic acid sequence according to 
claim 43 wherein said nucleic acid comprises the 
nucleotide sequence set forth in Fig. 31C. 

45. An isolated protein comprising the 
sequence of amino acids set forth in Fig. 32B. 

46. A nucleic acid comprising a nucleotide 
sequence encoding the protein according to claim 45. 

47. The nucleic acid sequence according to 
claim 46 wherein said nucleic acid comprises the 
nucleotide sequence set forth in Fig. 32C. 

48. An isolated protein comprising the 
sequence of amino acids set forth in Fig. 3 3B. 

49. A nucleic acid comprising a nucleotide 
sequence encoding the protein according to claim 48. 

50. The nucleic acid sequence according to 
claim 49 wherein said nucleic acid comprises the 
nucleotide sequence set forth in Fig. 33C. 
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51. An isolated protein comprising the 
sequence of amino acids set forth in Fig. 34B. 

52. A nucleic acid comprising a nucleotide 
sequence encoding the protein according to claim 51. 

53 . The nucleic acid sequence according to 
claim 52 wherein said nucleic acid comprises the 
nucleotide sequence set forth in Fig. 34C. 

54. An isolated protein comprising the 
sequence of amino acids set forth in Fig. 35B. 

55. A nucleic acid comprising a nucleotide 
sequence encoding the protein according to claim 54. 

56. The nucleic acid sequence according to 
claim 55 wherein said nucleic acid comprises the 
nucleotide sequence set forth in Fig. 35C. 

57 . An isolated protein comprising the 
sequence of amino acids set forth in Fig. 3 6B. 

58. A nucleic acid comprising a nucleotide 
sequence encoding the protein according to claim 57. 

59. The nucleic acid sequence according to 
claim 58 wherein said nucleic acid comprises the 
nucleotide sequence set forth in Fig. 3 6C. 

60. An isolated protein comprising the 
sequence of amino acids set forth in Fig. 3 7B. 

61. A nucleic acid comprising a nucleotide 
sequence encoding the protein according to claim SO 
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62. The nucleic acid sequence -according to 
claim 61 wherein said nucleic acid comprises the 
nucleotide sequence set forth in Fig. 3 7C. 

63 . An isolated protein comprising the 
sequence of amino acids set forth in Fig. 3 8B. 

64. A nucleic acid comprising a nucleotide 
sequence encoding the protein according to claim 63 . 

65. The nucleic acid sequence according to 
claim 64 wherein said nucleic acid comprises the 
nucleotide sequence set forth in Fig. 3 8C. 

66. An isolated protein comprising a CF or CF1 
form of the amino acid sequence set forth in any one 
of Figs. 39A-127A. 

67. A nucleic acid comprising the nucleotide 
sequence set forth in Fig. 3 9B. 

68. A nucleic acid comprising the nucleotide 
sequence set forth in Fig. 4 OB. 

69. A nucleic acid comprising the nucleotide 
sequence set forth in Fig. 41B. 

70. A nucleic acid comprising the nucleotide 
sequence set forth in Fig. 42B. 

71. A nucleic acid comprising the nucleotide 
sequence set forth in Fig. 43B. 

72. A nucleic acid comprising the nucleotide 
sequence set forth in Fig. 44B. 
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73. A nucleic acid comprising the nucleotide 
sequence set forth in Fig. 45B. 

74. A nucleic acid comprising the nucleotide 
sequence set forth in Fig. 46B. 

75. A nucleic acid comprising the nucleotide 
sequence set forth in Fig. 4 7B. 

76. A nucleic acid comprising the nucleotide 
sequence set forth in Fig. 48B. 

77. A nucleic acid comprising the nucleotide 
sequence set forth in Fig. 49B. 

78. A nucleic acid comprising the nucleotide 
sequence set forth in Fig. SOB. 

79. A nucleic acid comprising the nucleotide 
sequence set forth in Fig. 51B. 

80. A nucleic acid comprising the nucleotide 
sequence set forth in Fig, 52B, 

81. A nucleic acid comprising the nucleotide 
sequence set forth in Fig. 53B. 

82. A nucleic acid comprising the nucleotide 
sequence set forth in Fig. 54B. 

83. A nucleic acid comprising the nucleotide 
sequence set forth in Fig. 55B. 

84. A nucleic acid comprising the nucleotide 
sequence set forth in Fig. 56B. 
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85 . 


A nucleic acid comprising 


the 


nucleotide 


sequence 


set forth in Fig. 57B. 






86. 


A nucleic acid comprising 


the 


nucleotide 


sequence 


set forth in- Fig. 58B. 






87. 


A nucleic acid comprising 


the 


nucleotide 


sequence 


set forth in Fig. 59B. 






88 . 


A nucleic acid comprising 


the 


nucleotide 


sequence 


set forth- in Fig. 6 OB. 






89. 


A nucleic acid comprising 


the 


nucleotide 


sequence 


set forth in Fig. 61B. 








A nucleic acid comprising 


the 


nucleotide 


sequence 


set forth in Fig. 62B. 






91 . 


A nucleic acid comprising 


the 


nucleotide 




set forth in any one of Figs. 63B-84B, 65D 


67D and i 


68D. 






92. 


A nucleic acid comprising 


the 


nucleotide 


sequence 


set forth in any one of. Figs. 85B-106B, 


88D, 90D 


and 92D. 






93 . 


A nucleic acid comprising 


the' 


nucleotide 



sequence set forth in any one of Pigs. 107B-127B, 
109D, HID and 112D. 

94. A vector comprising the nucleic acid 
according to any one of claims 2-7, 9-2 8, 31, 32, 
34, 35, 37, 38, 40, 41, 43, 44, 46, 47, 49, 50, 52, 
53, 55, 56, 58, 59, 61, 62, 64, 65 and 67-93. 
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95. A composition comprising at least one 
protein or nucleic acid according to any one of 
claims 1-93 and a carrier. 

96. A method of inducing an immune response in 
a mammal comprising administering to said mammal an 
amount of at least one protein and/or nucleic acid 
according to any one of claims 1-93 sufficient to 
effect said induction. 
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Fig. 1B 



Cleavage Fusion 
site domain 



Fig. 1C 



gp120 



gp140CF [ 



gp160 t 



ft=n 



Fig. 1D 





CON6.env (group M env consensus. This one contain five variable regions in em 
from 98CN006 virus, not in the public domain yet) 

GCC^CCATGCGCGTGATGGGCATCCAGCGC^CTGCCAGCACCTGTGGCGCTGGGGCACCfi 
CTGGGCATGCTGATGATCTGCTCCGCCGCCGAGAACCTGTGGGTGACCGTGTACTACGGC ' 
GTGCCCGTGTGGAAGGAGGCCAACACCACCCTGTTCTGCGCCTCCGACGCCAAGGCCTAC 
GACACCGAGGTGCACAACGTGTGGGCCACCCACGCCTGCGTGCCCACCGACCCCAACCCC 
CAGGAGATCGTGCTGGAGAACGTGACCGAGAACTTCAACATGTGGAAGAACAACATGGTG 
GAGCAGATGCACGAGGACATCATCTCCCTGTGGGACCAGTCCCTGAAGCCCTGCGTGAAG 
CTGACCCCCCTGTGCGTGACCCTGAACTGCACCAACGTGCGCAACGTGTCCTCCAACGGC 
ACCGAGACCGACAACGAGGAGATCAAGAACTGCTCCTTCAACATCACCACCGAGCTGCGC 
GACAAGAAGCAGAAGGTGTACGCCCTGTTCTACCGCCTGGACGTGGTGCCCATCGACGAC 
AAGAACTCCTCCGAGATCTCCGGCAAGAACTCCTCCGAGTACTACCGCCTGATCAACTGC 
AACACCTCCGCCATCACCCAGGCCTGCCCCAAGGTGTCCTTCGAGCCCATCCCCATCCAG' 
TACTGCGCCCCCGCCGGCTTCGCCATCCTGAAGTGCAACGACAAGAAGTTCAACGGCACC 
GGCCCCTGCAAGAACGTGTCCACCGTGCAGTGCACCCACGGCATCAAGCCCGTGGTGTCC 
ACCCAGCTGCTGCTGAACGGCTCCCTGGCCGAGGAGGAGATCATCATCCGCTCCGAGAAC 
ATCACCAACAACGCCAAGACCATCATCG TGCAGCTGAACGAGTCCGTGGAGATCAACTGC 
ACCCGCCCCAACAACAACACCCGCAAGTCCATCCACATCGGCCCCGGCCAGGCCTTCTAC 
GCCACCGGCGAGATCATCGGCGACATCCGCCAGGCCCACTGCAACATCTCCCGCACCAAG 
TGGAACAAGACCCTGCAGCAGGTGGCCAAGAAGCTGCGCGAGCACTTCAACAACAAGACC 
ATCATCTTCAAGCCCTCCTGCGGCGGCGACCTGGAGATCACCACCCACTCCTTCAACTGC 
GGCGGCGAGTTCTTCTACTGCAACACCTCCGGCCTGTTCAACTCCACCTGGATGTTCAAC 
GGCACCTACATGTTCAACGGCACCAAGGACAACTCCGAGACCATCACCCTGCCCTGCCGC 
ATCAAGCAGATCATCAACATGTGGCAGGGCGTGGGCCAGGCCATGTACGCCCCCCCCATC 
GAGGGCAAGATCACCTGCAAGTCCAACATCACCGGCCTGCTGCTGACCCGCGACGGCGGC 
AACAACTCCAACAAGAACAAGACCGAGACCTTCCGCCCCGGCGGCGGCGACATGCGCGAC 
AACTGGCGGTGCGAGCTGTACAAGTACAAGGTGGTGAAGATCGAGCCCCTGGGCGTGGCC 
CCCACCAAGGCCAAGCGCCGCGTGGTGGAGCGCGAGAAGGGCGCCGTGGGCATCGGCGCC 
-GTGTTCCTGGGCTTCCTGGGCGCCGCCGGCTCCACCATGGGCGCCGCCTCCATCACCCTG 
ACCGTGCAGGCCCGCCAGCTGCTGTCCGGCATCGTGCAGCAGCAGTCCAACCTGCTGCGC 
GCCATCGAGGCCCAGCAGCACCTGCTGCAGCTGACCGTGTGGGGCATCAAGCAGCTGCAG 
GCCCGCGTGCTGGCCGTGGAGCGCTACCTGAAGGACCAGCAGCTGCTGGGCATCTGGGGC 
TGCTCCGGCAAGCTGATCTGCACCACCAACGTGCCCTGGAACTCCTCCTGGTCCAACAAG 
TCCCAGGACGAGATC TGGGACAACATGACCTGGATGGAGTGGGAGCGCGAGATCTCCAAC 
TACACCGACATCATCTACCGCCTGATCGAGGAGTCCCAGAACCAGCAGGAGAAGAACGAG 
CAGGAGCTGCTGGCCCTGGACAAGTGGGCCTCCCTGTGGAACTGGTTCGACATCACCAAC 
TGGCTGTGGTACATCAAGATCTTCATCATGATCGTGGGCGGCCTGATCGGCCTGCGCATC 
GTGTTCGCCGTGCTGTCCATCGTGAACCGCGTGCGCCAGGGCTACTCCCCCCTGTCCTTC : 
CAGACCCTGATCCCCAACCCCCGCGGCCCCGACCGCCCCGAGGGCATCGAGGAGGAGGGC 
GGCGAGCAGGGCCGCGACCGCTCCATCCGCCTGGTGAACGGCTTCCTGGCCCTGGCCTGG 
GACGACCTGCGCTCCCTGTGCCTGTTCTCCTACCACCGCCTGCGCGACTTCATCCTGATC 
GCCGCCCGCACCGTGGAGCTGCTGGGCCGCCGCTCCCTGCGCGGCCTGCAGAAGGGCTGG 
GAGGCCCTGAAGTACCTGGGCAACCTGCTGCAGTACTGGGGCCAGGAGCTGAAGAACTCC 
GCCATCTCCCTGCTGGACACCACCGCCATCGCCGTGGCCGAGGGCACCGACCGCGTGATC 

GAGATCGTGCAGCGCGCCTGCCGCGCCATCCTGAACATCCCCCGCCGCATCCGCCAGGGC 
CTGGAGCGCGCCCTGCTGTAA 
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Fig. 6A 

Oanaenv (subtype C ancestral env. The amino acid sequence is 
different from Los Alamos Database August 2002) — - 

CAAC GTGTG GG CCACC CAGGC CT GC GTG CC CAC CG AC CCCAACCC CP arr 
AGATGGTGCTOGAGAACGTGACCGAGAACTTCAACATGTOGtS 

atcgtggaccagatgcacgaggacatcatctccctgtggSac^SS? 

GAAG CC CTG CG TGAAG CTGAC CC CC CTG TG CGTGA CC CTGAACTG CA CCA 
£COTGACC^CGCCACCAACAACACCTACAACGGC^ 



SJ! TS,^; CAC CA CC GAG CT GC GCG AC AAG AAGAAGAAGGAG TA CG C 
a ^ GCCTG GACAT CG TG CCC CT GAACG AGAACTC CTC CG AG T 

ACCGCCTGATCAACTGCAACACCTCCGCCATC^CCCAGGCCTGCC 



GTGTCCTTCGACCCCATCCCCATCCACTACTGCGCCCCCGCCGGCTACGC 
•^TCCTGAAGTGCAACAACAAC^CCCTCAA^ 

ACGTGTCCA CC GTGCAGTGCACC CACGG CATCAAG CC CGTGGTGT CC r 

cagctgctgctgaacggctccctggccgaggaggagatcatcatccSct 

S^^^ ccga caacgccaagaccatcatcgtccagSg^cgSS 

ccgtggagatcgtgtgcacccgccccaacaacaacacccgcaagtcc5t^ 

cgcatcggccccggccagaccttctacgccaccg^cgacatcISSSgS 

catccgcczaggcccactgcaacatctccgaggacaagtotaacaagaS 

tgcagcaggtggccg^gaagctgggcaagcacttccccaacaagaccS 

ACCTTCGAGCCCTCCTCCGGCGGCGACCTGGAGATCACctcCcS??c?¥ 

^^^ OTGCGAG ^ CTTCTACTCC ^^ C CTCCAAGCTGOTCAAC? 
CC AC CTACAAC AACAA CACCAAC TC CAACT CCACC AT CAC CCTGC CC TV r 

^ G ^^^ G -^® ^ < ^TCATCAACAT GTGGCAGGGCGTGGGCCAGGCCAT OTA 
CGCCCCCCCCATCGCCGGCAACATCACCTGC^GTCCAACATCACCGSr 
TG CTGCT GACC CGCGACGGCGGCAAGGAGAACACCACCGAGACCTTC CGC 

CAAGGTGGTGGAGATCAAGCC CCTGGGCGTGGCCCCCACCGAGGCCA ArC 

gccgcgtggtggagcgcgac^cgcgccgtc^gcctSgg 

CTGG GCTTC CT GGGCG GCGCC GG CT CCA CCATGGG CG CCG CC TCC AT CA C 

cctgaccgtgcaggcccgccagctgctgtccxsgcatcgtgcagcagcSt 

CCAA CCTGC TG CGCGC CATCG AGGC CCAGCAGCAC ATGCTGCAGCTG^AC C' 

ctgtggggcatc^gcagctgcaggcccgcctgc5ggccatcgS?Sct? 
cctcaaggaccagcagctgctgggcatctggggctgctccggc^gcS^ 

^ C ^ CCACCGCCGTGCCGTGG AACT 

gacgacatctgggacaacatgac CTGGATGGAGTGGGACCGCGAGAT CTC 

CAACTACAC CG ACACC ATCTA CC GC CTG CTGGAGGAGTCC CAGAA COAr r 

AGGAGAAGAAC GAG CAGGACC TG CT GGC CC TGGAC TC CTG GG AGAAC CT O 

T GGAA ^, TG ^ < -^^- < -^' T CACC AA.CT GGCTGTGG TAC^TCAAGATCTT CA T 

CATGATCGTGGGCGGCCTGATOSGCCTGCGCATCATCTTCGCCGTGCTGT 
CCATC GT GAA CC GC GTGCGCC AG GG CTA CT CCC CC CTGTC CT TCCAG AC r 

CTGACCCCCAACCCCCGCGGCCCCGACCGCCTGGAGCGCATCGAGGAGGA 
GGGCGG CGAGC AGGAC CGCGA CC GC TCCAT CCG CC TGGTG TC CGGCT TC r 
TGGC CCTGG CC TGGGACGACC TG CG CTC CCTGTGC CTGTT CTCCTAC CA r 

cgcctgcgcgacttcatcctgatcgccgcccgcaccgtggagctgctggS 

G ^ G 5^^ CCCTGCG ^ CCTGC AGCGCGGCTGGGAGGCCCTGAAGTACC 

^ g S^^ tcgtgcagtactggggc ^^agctgaagaagtccgccatc 

TCCC TGCTGGA CAC CATCGCC AT CG CCGTGGCCGAGGGCA CCGACCG CAT 

G^TC?^^S^SSgn^S^5 TG ^ OTCCC ^ C 
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Fig. 6B 

C.con.env (subtype C consensus env. The amino acid sequence 
is different from Los Alamos Database August 2002) 

GCCGCCATGCGCGTGATGGGCATCCTGCGCAACTGCCAGCAGTGGTGGAT 
CTGGGGCATCCTGGGCTTCTGGATGCTGATGATCTGCAACGTGGTGGGCA 
ACCTGTGGGTGACCGTGTACTACGGCGTGCCCGTGTGGAAGGAGGCCAAG 
ACCACCCTGTTCTGCGCCTCCGACGCCAAGGCCTACGAGAAGG AGGTGCA 
CAACGTGTGGGCGACCCACGCCTGCGTGCCCACCGACCCCAACCCCCAGG 
AGATGGTGCTGGAGAACGTGACCGAGAACTTCAACATGTGGAAGAACGAC' 
ATGGTGGACCAGATGCACGAGGACATCATCTCCCTGTGGGACCAGTCCCT 
GAAGCCCTGCGTGAAGCTGACCCCCCTGTGCG.TGACCCTGAACTGCCGGA 
ACGTGACCZ^CGCCACCAACAACACCTACAACGAGGAGATCAAG AACTGC 
TCCTTGAACATCACCACCGAGCTGCGCGACAAGAAGAAGAAGGTGTACGC 
CCTGTTCTACCGCCTGGACATCGTGCCCCTGAACGAGAACTCCTCCGAGT 
ACCGCCTGATCAACTGC^^CACCTCCGCCATCACCCAGGCCTGCCCCAAG 
GTGTCCTTCGACCCCATCCCCATCCACTACTGCGCCCCCGCCGGCTACGC 
CATCCTGAAGTGCAACAACAAGACCTTCAACGGCACCGGCCCCTG CAACA 
AGGTGTCCACCGTGCAGTGGACCCACGGCATCAAGCCCGTGGTGTCCACC' 
CAGCTGCTGCTGAACGGGTCCCTGGCCGAGGAGGAGATCATCATCCGCTC 
. CGAGAAC CTGACCAACAACGC CAAGAC CATCATCGTGCACCTGAACGAGT . 
CCGTGGAGATCGTGTGCACCCGCCCCAACAACAACACCCGCAAGTCCATC 
CGCATCGGCCCCGGCCAGACCTTCTACGCCACCGGCGACATCATCG GCGA 
CATCCGCC^GGCCCACTGC^^CATCTCCGAGGACAAGTGGAACAAGACCC 
TGCAGCGCGTGTCCAAGAAGCTGAAGGAGCACTTCCCGAACAAGACCA.TC 
AAGTTCGAGCCCTCCTCCGGCGGCGACCTGGAGATCACCACCCACTCCTT 
CAACTGCCGGGGCGAGTTCTTCTACTGCAACACCTCCAAGCTGTTCAACT 
CCACCTACAACAACAACACCAACTCCAACTCCACCATCACCCTGCCC TGC 
CGCATCAAG CAGATCATCAAGATGTGG CAGGAGGTGGGC CG CGC CATGTA 
CGCCCCCCCCATCGCCGGCAACATCACCTGC^GTCC^CATa\CCGGCC 
' TGCTGCTGACCCGCGACGGGGGCAAGAAGAACACCACCGAGATCTTCCGC 
CC CGGCGGCGGCGACATGCGCGACAACTGGCGCTCCGAGCTGTACAAGTA . 
CAAGGTGGTGGAGATCAAGCCCCTGGGCGTGGCCCCCACCAAGGCCAA GC 
GCCGCGTGGTGGAGCGCGAGAAGCGCGCCGTGGGCATCGGCGCCGTGTTC 
CTGGGCTTCGTGGGCGCCGCCGGCTCCACCATGGGCGCCGCCTCCATCAC 
CCTGACCGTGCAGGCCCGCCAGCTGCTGTCCGGCATCGTGCAGCAGCAGT ' 
CCAACCTGCTGCGCGCCATCGAGGCCCAGCAGCACATGCTGCAGCTGACC 
GTGTGGGGCATCAAGCAGCTGCAGACCCGCGTGCTGGCCATCGAGCGCTA 
CCTGAAGGACCAGCAGCTGCTGGGCATCTGGGGCTGCTCCGGCAAGCTGA 
TCTGCACCACCGCCGTGCCGTGGAACTCCTCCTGGTCCAACAAGTGCCAG 
GAGGACATCTGGGACAACATGACCTGGATGCAGTGGGACCGCGAGATCTC 
CAACTACACCGACACCATCTACCGCCTGCTGGAGGACTCCCAGAACCAGC 
AGGAGAAGAACGAGAAGGACCTGCTGGCCCTGGACTCCTGGAAGAACCTG 
TGGAACTGGTTCGACATCACCAACTGGCTGTGGTACATCAAGATCTTCAT 
CATGATCGTGGGCGGCCTGATCGGCCTGCGCATCATCTTCGCCGTGCTGT 
CCATCGTGAACCGCGTGCGCCAGGGCTACTCCCCCCTGTCCTTCCAGACC 
CTGACCCCCAACCCCCGCGGCCCCGACCGCCTGGGCCGCATCGAGGAGGA 
GGGCGGCGAGCAGGACCGCGACCGCTCCATCCGCCTGGTGTCCGGCTTCC 
TGGCCCTGGCCTGGGACGACCTGCGCTCCCTGTGCCTGTTCTCCTACCAC 
CGCCTGCGCGACTTCATCCTGGTGGCCGCCCGCGCCGTGGAGCTGCTGGG 
CCGCTCCTCCCTGCGCGGCCTGCAGCGCGGCTGGGAGGCCCTGAAGTACC 
TGGGCTCCCTGGTGCAGTACTGGGGCCTGGAiSCTGAAGAAGTCCGCCATC 
TCCCTGCTGGACACCATCGCCATCGCCGTGGCCGAGGGCACCGACCGCAT 
CATCGAGCTGATCCAGCGCATCTGCCGCGCCATCCGCAACATCCCCCGCC 
GCArCCGQS^£<3Wi<^ 
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Fig. 6E 

Synthesize entire gene in 80-rner fragments overlapping by 20 residues 
at the 3' end with invariant sequences at the 5' end. 



EcoRl Bbsl 



80mer fragment 



80mer fragment 



BsmBI BamHI 



Invariant Seq. 
(18 nt) 



Invariant Seq. 



Paired 80mer oligos are connected via PCR in a stepwise manner from 5' to 3' using 
primers complimentary to the invariant seq. 



80mer Fragment 



5'primer 



140bp PCR product 



3'primer 



108bp PCR fragments cloned into pGEM-T and sequenced. Clones with the proper sequence 
will be cut with 2 restriction enzymes. 4 fragments will be ligated together with pcDNA3.1 in 
a stepwise manner from the 5* to 3' end of gene 



Fragments to be ligated 

with pcDNA3.1 

(1-4 are in order from 5' to 3') 


Restriction Enzymes 
Used to Cleave 
Fragment 


Fragment 1 


EcoRI/BsmBI 


Fragment 2 


Bbsl/BsmBI 


Fragment 3 


Bbsl/BsmBI 


Fragment 4 


Bbsl/BamHI 


pcDNA3.1 


EcoRI/BamHI 



Fragment 2 



Fragment 1 
EcoRl 




Fragment 3 
Fragment 4 

BamHI 



I 



Ligations will be repeated stepwise 5' to 3' until the entire gene 
has been cloned into pcDNA3.1 
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Fig. 14B 

CONs.env (gorup M consensus env gene. This one contain the 
consensus sequence for variable regions in env gene. The identical 
amino acid sequences as in the public domain) 

GCCGCCGCCATGCGCGTGCGCGGCATCCAGCGCAACTGCCAGCACCTGTG 
GCGCTGGGGCACCCTGATCCTGGGCATGCTGATGATCTGCTCCGCCGCCG 
AGAACCTGTGGGTGACCGTGTACTACGGCGTGCCCGTGTGGAAGGAGGCC 
AACACCACCCTGTTCTGCGCCTCCGACGCCAAGGCCTACGACACCGAGGT 
GCACAACGTGTGGGCCACCCACGCCTGCGTGCCCACCGACCCCAACCCCC 
AGGAGATCGTGCTGGAGAACGTGACCGAGAACTTCAACATGTGGAAGAAC 
AACATGGTGGAGCAGATGCACGAGGACATCATCTCCCTGTGGGACCAGTC 
CCTGAAGCCCTGCGTGAAGCTGACCCCCCTGTGCGTGACCCTGAACTGCA 
CCAACGTGAACGTGACCAAC^CCACCAAC^CACCGAGGAGAAGGGCGAG 
AT CAAGAACTGCTC CTTCAACATCACCAC CGAGATC CG CGAC AAGAAGCA 
GAAGGTGTACGCCCTGTTCTACCGCCTGGACGTGGTGCCCATCGACGACA 
ACAAC^ACAACTCCTCCAACTACCGCCTGATCAACTGCAACACCTCCGCC 
ATCACCCAGGCCTGCCCCAAGGTGTCCTTCGAGCCCATCCCCATCCACTA 
CTGCGCCCCCGCCGGCTTCGCCATCCTGAAGTGCAACGACAAGAAGTTCA 
ACGGGACCGGCCCCTGCAAGAACGTGTCCACCGTGCAGTGCACCCACGGC 
ATCAAGCCCGTGGTGTCCACCCAGCTGCTGCTGAACGGCTCCCTGGCCGA 
GGAGGAGATCATCATCCGCTCCGAGAACATCACCAACAACGCCAAGACCA 
T CATCGTGCAG CT GAACGAGTC CGTGGAGATCAACTGCACC C GCCC CAAC 
AACAACACCCGCAAGTCCATCCGCATCGGCCCCGGCCAGGCCTTCTACGC 
CACCGGCGACATCATCGGCGACATCCGCCAGGCCCACTGCAACATCTCCG 
GCACCAAGTGGAACAAGACCCTGCAGCAGGTGGCCAAGAAGCTGCGCGAG 
CACTTCAACAACAAGACCATCATCTTCAAGCCCTCCTCCGGCGGCGACCT 
GGAGATCACCACCCACTCCTTCAACTGCCGCGGCGAGTTCTTCTACTGCA 
ACACCTCCGGCCTGTTCAACTCCACCTGGATCGGCAACGGCACCAAGAAC 
AACAACAAGACCAACGACACCATCACCCTGCCCTGCCGCATCAAGCAGAT 
CATCAACATGTGGCAGGGCGTGGGCCAGGCCATGTACGCCCCCCCCATCG 
AGGGCAAGATCACCTGCAAGTCCAACATCACCGGCCTGCTGCTGACCCGC 
GACGGCGGCAACAACAAGACCAACGAGACCGAGATCTTCCGCCCCGGCGG ~ 
CGGCGACATGCGCGACAACTGGCGCTCCGAGCTGTACAAGTACAAGGTGG 
TGAAGATCGAGCCCCTGGGCGTGGCCCCCACCAAGGCCAAGCGCCGCGTG 
GTGGAGCGCGAGAAGCGCGCCGTGGGCATCGGCGCCGTGTTCCTGGGCTT 
CCTGGGCGCCGCCGGCTCCACCATGGGCGCCGCCTCCATCACCCTGACCG 
TGCAGGCCCGCCAGCTGCTGTCCGGCATCGTGCAGCAGCAGTCCAACCTG 
CTGCGCGCCATCGAGGCCCAGCAGCACCTGCTGCAGCTGACCGTGTGGGG 
CATCAAGCAGCTGCAGGCCCGCGTGCTGGCCGTGGAGCGCTACCTGAAGG 
ACCAGCAGCTGCTGGGCATCTGGGGCTGCTCCGGCAAGCTGATCTGCACC ' 
ACCACGGTGCCCTGGAACTCCTCCTGGTCCAACAAGTCCCAGGACGAGAT 
CTGGGACAACATGACCTGGATGGAGTGGGAGCGCGAGATCAACAACTACA 
CCGACATCATCTACTCCCTGATCGAGGAGTCCCAGAACCAGCAGGAGAAG 
AACGAGCAGGAGCTGCTGGCCCTGGACAAGTGGGCCTCCCTGTGGAACTG 
GTTCGACATCACCAACTGGCTGTGGTACATCAAGATCTTCATCATGATCG 
TGGGCGGCCTGATCGGCCTGCGCATCGTGTTCGCCGTGCTGTCCATCGTG 
AACCGCGTGCGCCAGGGCTACTCCCCCCTGTCCTTCCAGACCCTGATCCC 
CAACCCCCGCGGCCCCGACCGCCCCGAGGGCATCGAGGAGGAGGGCGGCG 
AGCAGGACCGCGACCGCTCCATCCGCCTGGTGAACGGCTTCCTGGCCCTG 
GCCTGGGACGACCTGCGCTCCCTGTGCCTGTTCTCCTACCACCGCCTGCG 
CGACTTCATCCTGATCGCCGCCCGCACCGTGGAGCTGCTGGGCCGCAAGG 
GCCTGCGCCGCGGCTGGGAGGCCCTGAAGTACCTGTGGAACCTGCTGCAG 
TACTGGGGCCAGGAGCTGAAGAACTCCGCCATCTCCCTGCTGGACACCAC 
CGCCATCGCCGTGGCCGAGGGCACCGACCGCGTGATCGAGGTGGTGCAGC 
' GCGCCTGCCGCGCCATCCTGAACATCCCCCGCCGCATCCGCCAGGGCCTG 
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Fig. 15A 




Fig. 15B 



Cell lysate Supernatant 

Expression of A.con env gene in mammalian cells 
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Fig. 18B 



A.con.env (subtype A consensus env. Identical amino acid 
sequence to that in the public domain) 

GCCGCCGCCATGCGCGTGATGGGCATCCAGCGCAACTGCGAGCACCTGTG 
GCGCTGGGGCACCATGATCCTGGGCATGATCATCATCTGCTCCGCCGCCG 
AGAACCTGTGGGTGACCGTGTACTACGGCGTGCCCGTGTGGAAGGACGCC 
GAGACCACCCTGTTCTGCGCCTCCGACGCCAAGGCCTACGACACCGAGGT 
GCACAACGTGTGGGCCACCCACGCCTGCGTGCCCACCGACCCCAACCCCC 
AGGAGATCAACCTGGAGAACGTGACCGAGGAGTTCAACATGTGGAAGAAC 
AACATGGTGGAGCAGATGCACACCGACATCATCTCCCTGTGGGACCAGTC 
CCTGAAGCCCTGCGTGAAGCTGACCCCCCTGTGCGTGACCCTGAAGTGCT 
C CAAC GTGAAC G T G AC CAC CAACAT CAC CAACATCAC CGACAACATGAAG 

GGCGAGATCAAGAACTGCTCCTTCAACATGACCACCGAGCTGCGCGACAA 

GAAGCAGAAGGTGTACTCCCTGTTCTACAAGCTGGACGTGGTGCAGATCA 

ACAAGTCCAACTCCTCCTCCCAGTACCGCCTGATCAACTGCAACACCTCC 

GCCATCACCCAGGCC TGCCCCAAGGTGTCCTTCGAGCCCATCCCGATCCA 

CTACTGCGCCCCCGCCGGCTTCGCCATCCTGAAGTGCAAGGACAAGGAGT 

TCAACGGCACCGGCCCCTGCAAGAACGTGTCCACCGTGCAGTGCACCCAC 

GGCATCAAGCCCGTGGTGTCCACCCAGCTGCTGCTGAACGGCTCCCTGGC 

CGAGGAGGAGGTGATGATCCGCTCCGAGAACATCACCAACAACGCCAAGA 

ACATCATCGTGCAGCTGACCAAGCCCGTGAAGATCAACTGCACCCGCCCC 

AACAACAACACCCGCAAGTCCATCCGCATCGGCCCCGGCCAGGCCTTCTA 

CGCCACCGGCGACATCATCGGCGACATCCGCCAGGCCCACTGCAACGTGT 

CCCGCACCGAGTGGAACGAGACCCTGCAGAAGGTGGCCAAGCAGCTGCGC 

AAGTACTTCAACAACAAGACCATCATCTTCACCAACTCCTCCGGCGGCGA 

CCTGGAGATCACCACCCACTCCTTCAACTGCGGCGGCGAGTTCTTCTACT 

GCAACACCTCCGGCCTGTTCAACTCCACCTGGAACGGCAACGGCACCAAG 

AAGAAGAACTCCACCGAGTCCAACGACACCATCACCCTGCCCTGCCGCAT 

CAAGCAGATCATCAACATGTGGCAGCGCGTGGGCCAGGCCATGTACGCCC 

CCCCCATCCAGGGCGTGATCCGCTGCGAGTCCAACATCACCGGCCTGCTG 

CTGACCCGCGACGGCGGCGACAACAACTCCAAGAACGAGACCTTCCGCCC 

CGGCGGCGGCGACATGCGCGACAACTGGCGCTCCGAGCTGTACAAGTACA 

AGGTGGTGAAGATCGAGCCCCTGGGCGTGGCCCCCACCAAGGCCAAGCGC 

CGCGTGGTGGAGCGCGAGAAGCGCGCCGTGGGCATCGGCGCCGTGTTCCT 

GGGCTTCCTGGGCGCCGCCGGCTCCACCATGGGCGCCGCCTCCATCACCC 

TGACCGTGCAGGCCCGCCAGCTGCTGTCCGGCATCGTGCAGCAGCAGTCC 

AACCTGCTGCGCGCCATCGAGGCCCAGCAGCACCTGCTGAAGCTGACCGT 

GTGGGGCATCAAGCAGCTGCAGGCCCGCGTGCTGGCCGTGGAGCGCTACC 

TGAAGGACCAGCAGCTGCTGGGCATCTGGGGCTGCTCCGGCAAGCTGATC 

TGCACCACCAACGTGCCCTGGAACTCCTCCTGGTCCAACAAGTCCCAGTC 

CGAGATCTGGGACAACATGACCTGGCTGCAGTGGGACAAGGAGATCTCCA 

ACTACACCGACATCATCTACAACCTGATCGAGGAGTCCCAGAACCAGCAG 

GAGAAGAACGAGCAGGACCTGCTGGCCCTGGACAAGTGGGCCAACCTGTG 

GAACTGGTTCGACATCTCCAACTGGCTGTGGTACATCAAGATCTTCATCA 

TGATCGTGGGCGGCCTGATCGGCCTGCGCATCGTGTTCGCCGTGCTGTCC 

GTGATCAACCGCGTGCGCCAGGGCTACTCCCCCCTGTCCTTCCAGACCCA 

CACCCCCAACCCCGGCGGCCTGGACCGCCCCGGCCGCATCGAGGAGGAGG 

GCGGCGAGCAGGGCCGCGACCGCTCCATCCGCCTGGTGTCCGGCTTCCTG 

GCCCTGGCCTGGGACGACCTGCGCTCCCTGTGCCTGTTCTCCTACCACCG 

CCTGCGCGACTTCATCCTGATCGCCGCCCGCACCGTGGAGCTGCTGGGCC 

ACTCCTCCCTGAAGGGCCTGCGCCTGGGCTGGGAGGGCCTGAAGTACCTG 

TGGAACCTGCTGCTGTACTGGGGCCGCGAGCTGAAGATCTCCGCCATCAA 

CCTGCTGGACACCATCGCCATCGCCGTGGCCGGCTGGACCGACCGCGTGA 

TCGAGATCGGCCAGCGCATCTGCCGCGCCATCCTGAACATCCCCCGCCGC 

ATCCGCQ^^fOTJ5t3W3ClKG^C3lEtrrOT^ p 
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Fig. 18C 




Fig. 18D 



Cell lysate Supernatant 

Expression of Axon env gene in mammalian cells 



M.con.gag (group M consensus gag. Identical amino acid sequence 
to that in the public domain) 

GCCGCCGCCATGGGCGCCCGCGCCTCCGTGCTGTCCGGCGGCAAGCTGGA 
CGCCTGGGAGAAGATCCGCCTGCGCCCCGGCGGCAAGAAGAAGTACCGCC 
TGAAGCACCTGGTGTGGGCCTCCCGCGAGCTGGAGCGCTTCGCCCTGAAC- 
CCCGGCCTGCTGGAGACCTCCGAGGGCTGCAAGCAGATCATCGGCCAGCT 
GCAGCCCGCCCTGCAGACCGGCTCCGAGGAGCTGCGCTCCCTGTACAACA 
CCGTGGCCACQCTGTACTGCGTGCACCAGCGCATCGAGGTGAAGGACACC . 
AAGGAGGCCCTGGAGAAGATCGAGGAGGAGCAGAACAAGTCCCAGCAGAA ' 
GACCCAGCAGGCCGCCGCCGACAAGGGCAACTCCTCCAAGGTGTCCCAGA 
ACTACCCCATCGTGCAGAACCTGCAGGGCCAGATGGTGCACCAGGCCATC 
TCCCCCCGCACCCTGAACGCCTGGGTGAAGGTGATCGAGGAGAAGGCCTT ' 
Fid 1 0A CTCCCCCGAGGTGATCCCCATGTTCTCCGCCCTGTCCGAGGGCGCCACCC 
I /y. / C7/1 CCCAGGACCTGAACACCATGCTGAACACCGTGGGCGGCCACCAGGCCGCC 
ATGCAGATGCTGAAGGACACCATCAACGAGGAGGCCGCCGAGTGGGACCG 
CCTGCACCCCGTGCACGCCGGCCCCATCCCCCCCGGCCAGATGCGCGAGC 
CCCGCGGCTCCGACATCGCCGGCACCACCTCCACCCTGCAGGAGCAGATC 
GCCTGGATGACCTCCAACCCCCCCATCCCCGTGGGCGAGATCTACAAGCG 
CTGGATCATCCTGGGCCTGAACAAGATCGTGCGCATGTACTCCCCCGTGT 
CCATCCTGGACATCCGCCAGGGCCCCAAGGAGCCCTTCCGCGACTACGTG 
GACCGCTTCTTCAAGACCCTGCGCGCCGAGCAGGCCACCCAGGACGTGAA 
GAACTGGATGACCGACACCCTGCTGGTGCAGAACGCCAACCCCGACTGCA 
AGACCATCCTGAAGGCCCTGGGCCCCGGCGCCACCCTGGAGGAGATGATG 
ACCGCCTGCCAGGGCGTGGGCGGCCCCGGCCACAAGGCCCGCGTGCTGGC 
CGAGGCCATGTCCCAGGTGACCAACGCCGCCATCATGATGCAGCGCGGCA 
ACTTCAAGGGCCAGCGCCGCATCATCAAGTGCTTCAACTGCGGCAAGGAG 
GGCCACATCGCCCGCAACTGCCGCGCCCCCCGCAAGAAGGGCTGCTGGAA 
GTGCGGCAAGGAGGGCCACCAGATGAAGGACTGCACCGAGCGCCAGGCCA 
ACTTCCTGGGCAAGATCTGGCCCTCCAACAAGGGCCGCCCCGGCAACTTC 
CTGCAGTCCCGCCCCGAGCCCACCGCCCCCCCCGCCGAGTCCTTCGGCTT 
CGGCGAGGAGATCACCCCCTCCCCCAAGCAGGAGCCCAAGGACAAGGAGC 
CCCCCCTGACCTCCCTGAAGTCCCTGTTCGGCAACGACCCCCTGTCCCAG 
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GCCGCCGCCATGCCCCAGATCACCCTGTGGCAGCGCCCCCTGGTC3ACCAT J— ^or-» 

CAAGATCGGCGGCCAGCTGAAGGAGGCCCTGCTGGCCAC03GCGCCGACG F/C7. 1 9B 

ACACCGTGCTGGAGGAGATCAACCTGCCCGGCAAGTGGAAGCCCAAGATG ^ 

ATCGGCGGCATCGGCGGCTTCATCAAGGTGCGCCAGTACGACCAGATCCT 

GATCGAGATCTGCGGCAAGAAGGCCATCGGCACCGTGCTGGTGGGCCCCA 

CCCCCGTGAACATCATCGGGCGCAACATGCTGACCCAGATCGGCTGCACC 

CTGAACTTCCCCATCTCCCCCATCGAGACCGTGCCCGTGAAGCTGAAGCC 

CGGCATGGACGGCCCCAAGGTGAAGCAGTGGCCCCTGACCGAGGAGAAGA 

TCAAGGCCCTGACCGAGATCTGCACCGAGATGGAGAAGGAGGGCAAGATC 

TCCAAGATCGGCCCCGAGAACCCCTACAACACCCCCATCTTCGCCATCAA 

GAAGAAGGACTCCACCAAG1GGCGCAAGCTGGTGGACTTCCGCGAGCTGA 

ACAAGCGCACCCAGGACTTCTGGGAGGTGCAGCTGGGCATCCCCCACCCC 

GCCGGCCTGAAGAAGAAGARGTCCGTGACCGTGCTGGACGTGGGCGAOGC 

CTACTTCTCCGTGCCCCTGGACGAGGACTTCCGCAAGTACACCGCCTTCA 

CCATCCCCTCCATCAACAAOSAGACCCCCGGCATCCGCTACCAGTACAAC 

GTGCTGCCCCAGGGCTGGAAGGGCTCCCCCGCCATCTTCCAGTCCTCCAT 

GACCAAGATCCTGGAGCCCTrCCGCACCCAGAACCCCGAGATCGTGATCT 

ACCAGTACATGGACGACCTGTACGTGGGCTCCGACCTGGAGATCGGCCAG 

CACCGCGCCAAGATCGAGGAGCTGCGCGAGCACCTGCTG03CTGGGGCTT 

CACCACCCCCGACAAGAAGCACCAGAAGGAGCCCCCCTTCCTGTGGATGG 

GCTACGAGCTGCACCCCGACAAGTGGACCGTGCAGCCCATCCAGCTGCCC 

GAGAAGGACTCCTGGACCGT3AACGACATCCAGAAGCTGGTGGGCAAGCT 

GAACTGGGCCTCCCAGATCTACCCCGGCATCAAGGTGAAGCAGCTGTGCA 

AGCTGCTGCGCGGCGCCAAGGCCCTGACCGACATCGTGCCXZCTGACCGAG 

GAGGCCGAGCTGGAGCTGGCJCGAGAACCGCGAGATCCTGAAGGAGCCCGT 

GCACGGCGTGTACTACGAC0CCTCCAAGGACCTGATCGCO3AGATCCAGA 

AGCAGGGCCAGGACCAGTGGACCTACCAGATCTACCAGGAGCCCTTCAAG 

AACCTCAAGACCGGCAAGTACGCCAAGATGCGCTCCGCCCACACCAACGA 

CGTGAAGCAGCTGACCGAGGCCGTGCAGAAGATCGCCACaGAGTCCATCG 

TGATCTGGGGCAAGACCCCCAAGTTCCGCCTGCCCATCCAGAAGGAGACC 

TGGGAGACCTGGTGGACCGAGTACTGGCAGGCCACCTGGATTCCCGAGTG 
GGAGTTCGTGAACACCCCCOCCCTGGTGAAGCTGTGGTACCAGCTGGAGA 
AGGAGCCCATCGCCGGCGCCGAGACCTTCTACGTGGACGGCGCCGCCAAC 
CGCGAGACCAAGCTGGGCAAGGCCGGCTACGTGACCGACCECGGCCGCCA 
GAAGGTGGTGTCCCTGACCGAGACCACCAACCAGAAAACaGAGCTGCAGG 
CCATCCACCTGGCCCTGCAGGACTCCGGCTCCGAGGTGAACATCGTGACC 
GACTCCCAGTACGCCCTGGGCATCATCCAGGCCCAGCCCGACAAGTCCGA 
GTCCGAGCTGGTGAACCAGATCATCGAGCAGCTGATCAAGAAGGAGAAGG 
TGTACCTGTCCTGGGTGCC03CCCACAAGGGCATCGGCGGCAACGAGCAG 
GTGGACAAGCTGGTGTCCACCGGCATCCGCAAGGTGCTGTTCCTGGACGG 
CATCGACAAGGCCCAGGAGGAGCACGAGAAGTACCACTCCAACTGGCGCG 
CCATGGCCTCCGACTTCAAGCTGCCCCCCATCGTGGCCAAGGAGATCGTG 
GCCTCCTGCGACAAGTGCCAGCTGAAGGGCGAGGCCATGCACGGCCAGGT 
GGACTGCTCCCCCGGCATCTCGCAGCTGGACTGCACCCACCTGGAGGGCA 
AGATCATCCTGGTGGCCGTGCACGTGGCCTCCGGCTACATCGAGGCCGAG 
GTGATCCCCGCCGAGACCGGCCAGGAGACCGCCTACTTCATCCTGAAGCT 
GGCCGGCCGCTGGCCCGTGAAGGTGATCCACACCGACAA03GCTCCAACT 
TCACCTCCGCCGCCGTGAAGGCCGCCTGCTGGTGGGCCGGCATCCAGCAG 
GAGTTCGGCATCCCCTACAACCCCCAGTCCCAGGGCGTGGTGGAGTCCAT 
GAACAAGGAGCTGAAGAAGATCATCGGCCAGGTGCGCGACCAGGCCGAGC 
ACCTCAAGACCGCCGTGCAGATGGCCGTGTTCATCCACAACTTCAAGCGC 
AAGGGCGGCATCGGCGGCTACTCCGCCGGCGAGCGCATCATCGACATCAT 
CGCCACCGACATCCAGACCAAGGAGCTGCAGAAGCAGATCACCAAGATCC 
AGAACTTCCGCGTGTACTACCGCGACTCCCGCGACCCCATCTGGAAGGGC 
CCCGCCAAGCTGCTGTGGAAGGGCGAGGGCGCCGTGGTGATCCAGGACAA 
CTCCGACATCAAGGTGGTGCCCCGCCGCAAGGCCAAGATCATCCGCGACT 
ACGGC^GCAGATGGCCGjSOGACGACTGCGTGGCCGGCCGCCAGGACGAG 
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Fig. 19C 



M.con.nef (group M consensus nef. Identical amino acid sequence 
to that in the public domain) 

GCCG CCGCC AT GGG CG GCAAG TG GT CCA AG TCCTC CATCGTGGGC TG GC C 
CGCCGT GCG CGAGC GC AT CCG CC GCACC CA CCCCGCCGCCGAGGG CG TG G 
GCG CCGTGTC CCAGG AC CTGG AC AAGCACGGCGCC ATCAC CT CCT CCAAC 
ACCG CCGCC AACAACC CCGAC TG CG CCT GG CTGGAGG CCC AGGAG GAGG A 
GG AG GAGGT GG GCT TC CC CGT GC GC CCC CAGGTGC CC CTG CG CCC CA TG A 
CCTACAAGG CCGCC CT GGACC TG TC CCA CT TCC TG AAGGA GA AGG GC GG C 
CTGG AGGGC CT GAT CT ACTCC AAGAAGCGC CAGGAGATCC TG GAC CT GT G 
GGTG TACCA CA CCC AG GG CTACT TC CCC GA CTGGC AGAAC TACAC CC CC G 
GCCC CGGCATC CGCTACCCCCTG AC CTTCGGCTGGTGCTT CAAGC TGGT G 
CCCGTGGACCC CGAGG AGGTGGAGG AGG CCAACGAGGGCG AGAAC AACT C 
CCTGCTGCACC CCATG TGCCAGC AC GGCATGGAGGACGAGGAGOGCG AGG 
TG CT GATGT GG AAG TT CG ACT CC CG CCTGGCCCTGCGCCACATCG CC CG C 
GAG CT GCACC CC GAG TA CT ACAA GG ACT GC TAA ; 



C.con.pol.nuc 

GCCGCCGCCATGCCCCAGATCACCCTGTGGCAGCGCCCCCTGGTGTCCAT 
CAAGGTGGGCGGCCAGATCAAGGAGGCCCTGCTGGCCACCGGCGCCGACG 
ACACCGTGCTGGAGGAGATCAACCTGCCCGGCAAGTGGAAGCCCAAGATG 
ATCGGCGGCATCGGCGGCTTCATCAAGGTGCGCCAGTACGACCAGATCCT 
GATCGAGATCTGCGGCAAGAAGGCCATCGGCACCGTGCTGGTGGGCCCCA 
CCCCCGTGAACATCATCGGCCGCAACATGCTGACCCAGCT3GGCTGCACC 
CTGAACTTCCCCATCTCCCCCATCGAGACCGTGCCCGTGAAGCTGAAGCC 
CGGCATGGACGGCCCCAAGCTGAAGCAGTGGCCCCTGACOGAGGAGAAGA 
TCAAGGCCCTGACCGCCATCTGCGAGGAGATGGAGAAGGAGGGCAAGATC 
ACCAAGATCGGCCCCGAGAACCCCTACAACACCCCCGTCTTCGCCATCAA 
GAAGAAGGACTCCACCAAGTGGCGCAAGCTGGTGGACTTCCGCGAGCTGA 
ACAAGCGCACCCAGGACTTCTGGGAGGTGCAGCTGGGCATCCCCCACCCC 
GCCGGCCTGAAGAAGAAGAAGTCCGTGACCGTGCTGGACGTGGGCGACGC 
CTACTTCTCCGTGCCCCTGGACGAGGGCTTCCGCAAGTACACCGCCTTCA 
CCATCCCCTCCATCAACAAOGAGACCCCCGGCATCCGCTACCAGTACAAC 
GTGCTGCCCCAGGGCTGGAAGGGCTCCCCCGCCATCTTCCAGTCCTCCAT 
GACCAAGATCCTGGAGCCCTTCCGCGCCCAGAACCCCGAGATCGTGATCT 
ACCAGTACATGGACGACCTGTACGTGGGCTCCGACCTGGAGATCGGCCAG 
CACCGCGCCAAGATCGAGGAGCTGCGCGAGCACCTGCTGAAGTGGGGCTT 
CACCACCCCCGACAAGAAGCACCAGAAGGAGCCCCCCTTCCTGTGGATGG 
GCTACGAG'CTGCACCCCGACAAGTGGACCGTGCAGCCCATCCAGCTGCCC 
GAGAAGGACTCCTGGACCGTCAACGACATCCAGAAGCTGGTGGGCAAGCT 
GAACTGGGCCTCCCAGATCTACCCCGGCATCAAGGTGCGCCAGCTGTGCA 
AGCTGCTGCGCGGCGCCAAGGCCCTGACCGACATCGTGCCCCTGACCGAG 
GAGGCCGAGCTGGAGCTGGOCGAGAACCGCGAGATCCTGAAGGAGCCCGT 
GCACGGCGTGTACTACGACOCCTCCAAGGACCTGATCGCOGAGATCCAGA 
AGCAGGGCCACGACCAGTGGACCTACCAGATCTACCAGGAGCCCTTCAAG 
AACCTCAAGACCGGCAAGTACGCCAAGATGCGCACCGCCCACACCAACGA 
CGTGAAGCAGCTGACCGAGGCCGTGCAGAAGATCGCCATGGAGTCCATCG 
TGATCTGGGGCAAGACCCCCAAGTTCCGCCTGCCCATCCAGAAGGAGACC 
TGGGAGACCTGGTGGACCGACTACTGGCAGGCCACCTGGATTCCCGAGTG 
GGAGTTCGTGAACACCCCCCCCCTGGTGAAGCTGTGGTACCAGCTGGAGA 
AGGAGCC<SmE^mTO<3miGi»J^«ETCT^^ 
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Fig. 20A 

B.con.gag (subtype B consensus gag. The amino acid sequence is 
different from Los Alamos Database August 2002) 

GCCGCCGCCATGGGCGCCCGCGCCTCCGTGCTGTCCGGCGGCGAGCTGGA 
CCGCTGGGAGAAGATCCGCCTGCGCCCCGGCGGCAAGAAGAAGTACAAGC 
TGAAGCACATCGTGTGGGCCTCCCGCGAGCTGGAGCGCTTCGCCGTGAAC 
CCCGGCCTGCTGGAGACCTCCGAGGGCTGCCGCCAGATCCTGGGCGAGCT 
GCAGCCCTCCCTGCAGACCGGCTCCGAGGAGCTGCGCTCCCTGTACAACA 

•.CCGTGGCCACCCTGTACTGCGTGCACCAGCGCATCGAGGTGAAGGACACC 
AAGGAGGCCCTGGAGAAGATCGAGGAGGAGCAGAACAAGTCCAAGAAGAA 
GGCCCAGCAGGCCGCCGCCGACACCGGCAACTCCTCCCAGGTGTCCCAGA 
ACTACCCCATCGTGCAGAACCTGCAGGGCCAGATGGTGCACCAGGCCATC 
TCCCCCCGCACCCTGAACGCCTGGGTGAAGGTGGTGGAGGAGAAGGCCTT 
CTCCCCCGAGGTGATCCCCATGTTCTCCGCCCTGTCCGAGGGGGCCACCC 
CCCAGGACCTGAACACCATGCTGAACACCGTGGGCGGCCACCAGGCCGCC 
ATGCAGATGCTGAAGGAGACCAT'CAACGAGGAGGCCGCCGAGTGGGACCG 
CCTGCACCCCGTGCACGCCGGCCCCATCGCCCCCGGCCAGATGCGCGAGC 
CCCGCGGCTCCGACATCGCCGGCACCACCTCCACCCTGCAGGAGCAGATC 
GGCTGGATGACCAACAACCCCCCCATCCCCGTGGGCGAGATCTACAAGCG 
CTGGATCATCCTGGGCCTGAACAAGATCGTGCGCATGTACTCCCCCACCT 
CCATCCTGGACATCCGCCAGGGCCCCAAGGAGCCCTTCCGCGACTACGTG 
GAGCGCTTCTACAAGACCCTGCGCGCCGAGCAGGCCTCCCAGGAGGTGAA 
GAACTGGATGACCGAGACCCTGCTGGTGCAGAACGCCAACCCCGACTGCA 
AGACCATCCTGAAGGCCCTGGGCCCCGCCGCCACCCTGGAGGAGATGATG 
ACCGCCTGCCAGGGCGTGGGCGGCCCCGGCCACAAGGCCCGCGTGCTGGC 
CGAGGCCATGTCCCAGGTGACCAACTCCGCCACCATCATGATGCAGCGCG 
GCAACTTCCGCAACCAGCGCAAGACCGTGAAGTGCTTCAACTGCGGCAAG 
GAGGGCCACATCGCCAAGAACTGCCGCGCCCCCCGCAAGAAGGGCTGCTG 

• GAAGTGCGGCAAGGAGGGCCACCAGATGAAGGACTGCACCGAGCGCCAGG ' 

CCAACTTCCTGGGCAAGATCTGGCCCTCCCACAAGGGCCGCCCCGGCAAC 

TTCCTGCAGTCCCGCCCCGAGCCCAGCGCCCCCCCCGAGGAGTCCTTCCG 

CTTCGGCGAGGAGACCACCACCCCCTCCCAGAAGCAGGAGCCCATCGACA 

AGGAGCTGTACCCCCTGGCCTCCCTGCGCTCCCTGTTCGGCAACGACCCC 
TCCTCCCAGTAA ' 
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Fig. 20B 

B.con.env (subtype B consensus env. The amino acid sequence is 
different from Los Alamos Database August 2002) 

GC CG CC GCCAT GCG CG TGAAG GG CATCCGC AAGAA CTACC AG CAC CTGTG 
GCGCTGGGGCACCATGCTGCTGGGCATGCTGATGATCTGCTCCGCCGCCG 
AGAAG CTGTGGG TGACC GTGTACT AC GG CGTGCCCGTGTGGAAGG AGGC C 
ACCACCACC CTGTTCTGCGCCTC CGACG CCAAGGC CTACG AC ACC GAGG T 
GCACAACGTGTGGGCCACCCACGCCTGCGTGCCCACCGACCCCAACCCCC 
AGGAGGTGGTGCTGGAGAACGTGACCGAGAACTTCAACATGTGGAAGAAC 
AACATGGTGGAGCAGATGCAC GAGGACATC ATCTC CCTGTGGGAC CAGT C 
CCTGAAGCCCTGCGTGAAGC TGACC CCCCTGTGCG TGACC CTGAACTGCA 
CCGACCTGAAGAACAACCTGC TGAACAC CAACTCCTC CTC CGGCGAGAAG 
AT GG AG AAG GG CGAGATC AAGAA CT GCT CC TTC AA CATCA CC ACC TC CA T 
CCGCGACAAGGTGCAGAAGGAGTACGCC CTGTTCTACAAGCTGGACGTGG 
TG CC CATCG AC AACAA CAACAAC AC CTC CTACCGC CTGAT CTCCTGCAAC 
AC CT CCGTG ATCAC CCAGGCCTGCCC CAAGGTGTC CTTCGAGCCCATCCC 
CATC CACTACTGCG CC CC CGC CG GCTTCGC CATCC TGAAG TG CAA CGACA 
AGAAGT TCAAC GGC AC CGGCC CC TG CAC CAACGTG TC CAC CGTGC AG TG C 
AC CC AC GGC AT CCG CC CC GTG GT GT CCA CC CAG CT GC TGC TG AAC GG CT C 
CCTG GC CGAGG AGGAGGTGGT GATC CGC TC CGAGA ACTTC AC CGA CAAC G 
C CAAG ACC AT CATCG TG CAG CTGAACGAGTCCGTGGAGAT CAACTGCAC C 
CGCC CCAACAACAACACCCGC AAGT CCATC CACAT CGGCC CCGGC CGCG C 
CTTCTACACCACCGGCGAGAT CATC GGC GACATCCGCCAGGC CCA CTGCA 
ACAT CT CCC GC GCC AAGTGGAAC AACAC CC TGAAG CAGAT CG TGAAGAAG 
CTGC GC GAG CAGTT CG GC AAC AAGA CCATC GTGTT CAACC AG TCC TC CG G 
CGGC GA CCC CGAGATC GT GAT GC ACT CC TT CAACT GCGGC GG CGAGT TC T 
TCTACTGCAAC ACCAC CCAGC TG TT CAACT CCACC TGGAACGACAAC GG C 
AC CT GGAAC AA CAC CAAG GAC AAGAACA CCATC AC CCTGC CCTGC CGCAT 
CAAG CAGAT CATCAAC ATGTG GC AG GAG GT GGGCAAGGCC ATGTA CG CC C 
CCCCCATCCGCGGCCAGATCCGCTGCTCCTCCAACATCACCGGCCTGCTG 
CTGAC CCGCGAC GGCGG CAACAAC AAC AACGACAC CGAGATC TTC CGCC C 
CGGC GG CGG CG ACATG CGCGA CAAC TGG CG CTCCG AG CTG TACAAGTAC A 
AGGTGGTGAAGATCGAGCCCC TG GG CGTGG CCCCCACCAAGG CCAAG CG C 
CG CG TG GTG CAGCG CG AG AAG CG CGCCGTGGGCAT CGGCG CCATGTTCCT 
GGGC TT CCT GG GCG CC GC CGG CT CCACC AT GGG CG CCGCCTC CATGACC C 
TGAC CG TG CAG GC CCGCC AG CTG CT GTC CG GC ATC GTGCAGC AGC AGAA C 
AACC TG CTG CG CGC CATC GAG GC CCAGC AG CACCT GCTGC AG CTGAC CG T 
GTGGGGCAT CAAGCAGCTGCAGG CC CGC GTGCTGG CCGTGGAGCG CTAC C 
TG AAGG ACC AG CAG CT GC TGG GC AT CTG GG GCTGC TC CGG CAAGC TG AT C 
TGCACCACC AC CGTGC CCTGGAA CG CCT CC TGGTC CAACAAGTCC CTGG A 
CGAGATCTGGGACAACATGACCTGGATGGAGTGGGAGCGCGAGATCGACA 
ACTACACCTCC CTGAT CTACACCCT GAT CGAGGAGTCCCAGAACC AG CAG 
GAGAAGAAC GAGCAGGAG CTG CT GG AGCTGGACAAGTGGG CC TCC CTGT G 
GAACTGGTT CGACATCAC CAA CTGG CTG TGGTACATCAAG AT CTT CATCA 
TGATCGTGGGCGGCCTGATCGGCCTGCGCATCGTGTTCGCCGTGCTGTCC 
ATCGTGAACCGCGTGCGCCAGGGCTACTCC CCCCTGTCCTTCCAGACCCG 
CCTG CC CGC CC CCCGC GG CCC CG AC CGC CC CGAGGGCATC GAGGAGG AG G 
GC GG CG AGC GC GAC CG CGACC GC TC CGG CCGCC TG GTGGA CG GCT TC CT G 
GCCCTGATC TG GGACGACCTG CG CT CCC TGTGCCTGTTCT CCTAC CACC G 
CCTG CGCGA CCTGCTGCTGATCG TGACC CG CATCG TGGAGCTGCTGGGCC 
GC CG CGG CT GG GAG GT GCTGA AG TAC TG GTG GAAC CTGCT GC AGT AC TG G 
TCCCAGGAGCTGAAGAACTCCGCCGTGTCCCTGCTGAACGCCACCGCCAT 
CGCCGT GGC CGAGGGCACCGACCGCGTGAT CGAGG TGGTG CAGCG CGCCT 
GC CG CG CCATC CTGCA CATCC CC CG CCG CATCCGC CAGGG CC TGG AG CG C 
GC CCTGC TGTAA 
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Fig. 20B 

B.con.env (subtype B consensus env. The amino acid sequence is 
different from Los Alamos Database August 2002) 

GC CG CCGCC AT GCG CG TG AAG GG CATCC GC AAGAA CT ACC AG CAC CT GTG 
GCGCTGGGG CACCATG CTGCTGGGC ATG CTGATGATCTGCTC CGC CG CC G 
AG AAG CT GTG GG TG ACC GT GTA CT AC GG CG TGC C C GTGTG GAAGG AG GC C 
AC CA CC ACC CT GTT CT GCGCC TC CG ACG CC AAG GC CTACG AC ACC GA GG T 
GCACAACGTGTGGGCCACCCACGCCTGCGTGCCCACCGACCCCAACCCCC 
AGGAGGTGGTGCTGGAGAACGTGACCGAGAACTTCAACATGTGGAAGAAC 
AACATGGTGGAGCAGATGCAC GAGG ACATCATCTC CCTGT GGGAC CAGT C 
C CT GAA GC CCTGC GT GAAGC TGA CC CC CCT GTG CG TGACC CT GAA CT GC A 
CCGACCTGAAG AACAACCTGCTGAACAC CAACTCCTCCTC CGGCG AGAAG 
ATGG AG AAG GG CGAGA TC AAG AA CT GCT CC TTC AA CATCA CC ACC TC CAT 
CCGC GACAAGG TGC AG AAGGA GT AC GCC CT GTT CT ACAAG CT GGA CG TG G 
TGCC CATCG AC AACAA CAACAAC AC CTC CTACCGCCTGAT CT CCTGC AAC 
AC CT CCGTGATCAC CCAGG CCTGC CC CAAGGTGTC CTTCGAG CCCAT CC C 
CATC CACTA CT GCG CC CC CGC CG G C TTC GC CAT C C TG AAG TG CAA CG AC A 
AG AAGT TCAAC GGC AC CG GCC CC TG CAC CA ACG TG TC CAC CG TGC AG TG C 
AC C C AC GGC AT CCG CC CCGTG GT GT CCA CC CAG CT GC TGC TG AAC GG CT C 
CCTG GC CGA GG AGG AG GTGGT GATC CGC TC CGAGAACTTC AC CGACAACG 
C CAAGACC AT CAT CG TG CAG CT GAA CG AG TCCG TG GAGAT CAACTGCAC C 
CGCG CC AAC AACAA CA CC CGC AAGT CCATC CACATCGGCC CCGGC CG CG C * 

CTTCTA CAC CACCGGC GAGAT CATCGGCGACATCCGCCAGGC CCA CTGCA 
ACAT CTCCCGCGCCAAGTGGAAC AACAC CC TGAAG CAGAT CGTGAAG AAG 
CT GC GCGAG CAGTT CG GC AAC AAGA CCATC GTG TT CAACC AG TCC TC CG G 
CGGC GA CCC CG AGATC GTGAT GC ACT CC TT CAACTGCGGC GG CGAGT TC T 
TCTA CT GCAAC ACC AC CCAGC TG TT CAA CT CCACC TGGAACGACAAC GG C 
AC CT GGAAC AACAC CAAGGAC AAGAACA CCATC AC CCTGC CCTGC CG CAT 
CAAG CAGAT CA TCAAC ATGTG GC AG GAG GT GGG CAAG GCC AT GTA CG CC C 
CC CC CATCC GC GGC CAGATC C GC TG CTC CT CCAAC AT CAC CG GCC TG CT G 
CTGAC CCGCGAC GGC GG CAACAAC AAC AACG AC AC CGAGA TCTTC CG CC C 
CGGC GGCGG CG ACATG CGCGACAAC TGG CG CTC CG AG CTG TACAAGT AC A 
AG GT GGTGAAG ATC GAGC CCC TG GG CGT GG CCC CC AC CAA GG CCAAG CG C 
CG CG TGGTG CAGCG CG AGAAG CG CG CCG TG GG CAT CGGCG CC ATG TT CC T 

GGGCTTCCTGGGCGCCGCCGGCTCCACCATGGGCGCCGCCTCCATGACCC 
T GAC CG TG CAG GC CCG CC AG CTG CT GTC CG GC ATC GTGCAGC AGC AG AA C 
AACC TG CTG CG CGC CATC GAG GC CCAGC AG CAC CT GCTGC AG CTGAC CG T 
GTGG GG CAT CAAGC AG CTGCAGG CC CGC GT GCT GG CCGTGGAGCG CT AC C 
TG AAGG ACC AG CAG CT GC TGG GC AT CTG GG GCTGC TC CGG CAAGC TG AT C 
TG CACC ACC AC CGTGC CCTGG AA CG CCT CC TGGTC CAACAAG TCC CTGGA 

CGAGATCTGGGA CAACATGACCTGGATGGAGTGGGAGCGC GAGAT CGACA 
ACTACACCT CC CTGAT CTACACC CTGAT CG AGG AG TC CCAGA ACC AG CAG 
GAGAAGAAC GAGCAGG AG CTG CT GG AGC TG GAC AAGTGGG CC TCC CT GT G 
GAACTGGTT CG ACATC AC CAA CTGG CTG TG.GTACATCAAGAT CTT CATCA 
TGAT CG TGG GC GG C CT GATCG GC CT GCG CATCG TG TT CGC CG TGC TG TC C 
ATCGTG AA CC GCG TG CGC CAGGGCTAC TCC CC CCTGTCCT TC CAGAC CCG 
CCTG CC CGC CC CCCGCGGCCC CG AC CGC CC CGAGGGCATC GAGGAGGAGG 
GC GG CG AGC GC GAC CG CGACC GC TC CGG CC GCC TG GT GGA CG GCT TC CT G 
GC CC TGATC TG GGA CG AC CTG CG CT CCC TG TGC CT GT TCT CC TAC CA CC G 
CC TG CG CGA CC TG C TG CTGAT CG TGACC CG CAT CG TG GAG CT GCT GG GC C 
GC CG CGG CT GG GAG GT GCTGAAGTAC TG GTG GAAC CTGCT GC AGT AC TG G 
TCCC AGGAG CTGAAGAACTCCGC CGTGT CC CTGCT GAACG CCACC GC CAT 
CG CC GT GGC CG AGG GC AC CGA CC GC GTGAT CGAGG TGGTG CAGCG CG CC T 

GCCGCGCCATCCTGCACATCCCCCGCCGCATCCGCCAGGGCCTGGAGCGC 
GC CC TGC TG TAA 
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Fig. 21 
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Expression of subtype B consensus env and gag genes in 293T cells. 
Plasmids containing codon-bptimized subtype B consensus gp160 gp140 and gag 
genes were transfected into 293T cells, and protein expression was examined by 
Western Blot analysis of cell lysates . 48-hours post-transfection, cell lysates were 
collected, total protein content determined by the BCA protein assay and 2 ug of 
total protein was loaded per lane on a 4-20% SDS-PAGE gel. Proteins were 
transferred to a PVDF membrane and probed with serum from an HIV-1 subtype B 
infected individual. 
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Fig. 22 
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Co-receptor usage of subtype B consensus envelopes. 

Pseudotyped particles containing the subtype B consensus gp160 Env 
were incubated with DEAE-Dextran treated JC53-BL cells in the 
presence of AMD3100 (a specific inhibitor of CXCR4), TAK779 (a 
specific inhibitor of CCR5), and AMD3000+TAK779 to determine co- 
receptor usage. NL4.3, an isolate known to utilize CXCR4 and YU-2, 
a known CCR5-using isolate; were included as controls. 
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Fig.26A 

Year 2000 Con-S lAOCFI.Env 

MRVRGIQRNCQHLmWGTLILGMLMICSAAENLWVTVYYGVPVWKEAOT 

NVWATHACVPTDPl^QEIVLENVTENFEMW 

TNVl^TTlWrEEKGEIKNCSFM^ 

saitqacpkvsfepipihycapagfailkcitok^ 

SIAEEEIIIRSENITTJNAKTIIVQLiraS^ 
HCNISGTK^TLQQVAKKLREHF^TI^ 

IGNGTKNNNNTNDTITLPCRIKQIIKMWQGVGQAI^APPIEGKITCKSNI 

A gpl40 CFI is referred to HIV-1 envelope design with the cleavage-site-deleted (C). fusion-site-deleted 
Sop"imfc domTs m ' nant re9ion " de,eted » in additt °" to deletion - trans^Sne and ^ 

Fig. 26B 

Codoa-optimizad Year .2000 Con-S, 140CFI. seq 

GGAAGGAGGCCAACACCACCCTGTTCTGCGCCTCCGACGCCAAGGCCTACGA 



CCCTGTGGGACCAGTCCCTGAAGCCCTGCGTGAAGCTGAGCC^ 

accaacgtgaacgtgacc^caccaccaacaacaccgaggS 

ACGTGGTGCCCATCGACGACAACAACAACAACTGCTCCAACTACCGCCTGATCAArT'pnaa^T^ 
CGCCGGCTTCGCCATCCTGAAGTGCAACGACAAGAAGTTCAACGGCACCGGCCCC^^ 

tgtccaccgtgc^gtgcacccacggc^tcaagcccgtggtctccaSca^ 

TCCCTGGCCGAGGAGGAGATCATCATCCGCTCCGAGAACATCACCAACAACGCCAAC^ 




actccttcaaSg^S 
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Fig. 27 



Individual C56BL/6 Mouse T Cell Responses to HIV-1 Envelope Peptides 
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